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This  study  investigated  the  efficacy  of  selection  procedures 
associated  with  the  Armed  Service  Vocational  Aptitude  Battery 
(ASVAB).  The  main  hypothesis  tested  was  whether  the  Armed 
Forces  Qualification  Test  (AFQT),  an  ASVAB  composite,  is  a  valid 
predictor  of  training  success.  Additional  hypotheses  investigated 
whether  the  AFQT  is  a  more  valid  predictor  of  training  success 
than  the  individual  career-specific  selector  composites.  A  final 
hypothesis  dealt  with  the  expectation  that  the  AFQT  would  show  a 
larger  improvement  over  selector  composites  in  validities  for 
females  than  for  males. 

Results  indicate  the  AFQT  has  high  predictive  validity  for 
all  military  occupations  included  in  the  sample.  Differential 
validity  is  observed  to  favor  three  of  the  four  ASVAB  composites 
(Mechanical,  General,  and  Electronic).  For  the  fourth  composite, 
(Administrative),  the  AFQT  has  higher  validity,  and  the  increment  __ 
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does  favor  females.  Two  possible  explanations  for  the  findings 
are  discussed.  The  need  for  continued  research  is  indicated  .a 
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INTRODUCTION 


As  a  personnel  selection  instrument,  the  Armed  Service 
Vocational  Aptitude  Battery  (ASVAB)  is  a  vital  component  in  tFie 
maintenance  of  a  quality  military  force.  By  investigating  the 
method  in  wliich  ASVAB  scores  can  be  applied  in  seleci.ion 
procedures,  this  study  attempts  to  aid  the  continual  research 
efforts  to  maintain  the  ASVAB  as  a  state-of-the-art  selection 
instrument . 

To  elucidate  the  scope  of  this  research  problem,  a  number 
of  issues  involved  in  personnel  selection  are  discussed, 
including  the  concept  of  g,  and  gender  differctices  in  ability 
measures.  The  ASVAB  is  then  described  in  terms  of  its  history, 
content,  reliability,  and  validity.  Lastly,  validity 
generalization  and  meta-analysis  are  discussed,  as  the  basis 
for  t!iis  study. 

History  of  Per sonnel  Selection 

The  issues  surrounding  tlie  topic  of  personnel  selection 
are  much  more  complex  than  would  initially  he  assumed  by  a  casual 
ol)serv(’r.  A  liistorical  review  is  rof|uired  before  delving  into 
this  many-faceted  topic. 

Measurement  of  Iniman  abilities  has  long  l)een  of  interest  to 
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scientists.  Sir  Francis  Galton  set  up  an  anthropometric 
laboratory  in  the  late  18th  century.  He  measured  sensory  acuity 
and  reaction  Limes  of  thounntuls  of  subjects,  believing  that  these 
types  of  abilities  gauged  the  level  of  an  individual's  intellect 
(Anastasi,  1982).  More  refined  measures  of  intelligence  began 
principally  with  Binet,  who  in  1905  developed  an  intelligence 
test  based  on  what  he  felt  were  the  essential  components  of 
intelligence.  He  believed  intellect  consists  of  more  abstract 
faculties,  such  as  judgement  and  comprehension.  Binet 's 
intelligence  test  was  developed  for  the  purpose  of  identifying 
the  educable  mentally  retarded  children  in  Paris  schools  (DuBois, 
1970). 

The  first  group  test  used  to  assess  abilities  particularly 
for  the  purpose  of  personnel  selection  was  developed  during  World 
War  I,  by  a  team  of  psychologists  headed  by  Robert  Yerkes  (Graham 
&  Lilly,  198A).  The  resulting  test  called  the  Army  Alpha  was 
the  first  multiple  choice,  objectively  scorable,  group 
administerable  test  devised.  As  such,  it  provided  the  military 
v/ith  a  highly  efficient  means  of  measuring  cognitive  ability,  or 
intelligence,  of  World  War  I  recruits.  Though  the  Army  Alpha  was 
develoi^cd  to  reflect  a  measure  of  general  Intelligence,  it  did 
contain  individual  subtests  (specifically.  Oral  Directions, 
Arithmetrical  Reasoning,  Practical  Judgement,  Synonym-Antonym, 
Disarranged  Sentences,  Number  Series  Completion,  Analogies,  and 
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Information ) . 

At  that  time  in  the  history  of  psychological  testing,  "the 
prevailing  opinion  was  that  man  possesses  a  single,  all-important 
intellectual  ability"  (Ghiselli,  1973).  Charles  Spearman  (1904, 
1927)  was  a  leading  pioneer  in  the  investigation  of  the  concept 
of  intellectual  ability.  Through  his  invention  of  the  procedure 
of  factor  analysis,  Spearman  found  evidence  for  his  Two-Factor 
Theory  of  intelligence.  His  factor  analytic  procedures  identified 
intercorrelations  among  tests  of  intellectual  ability.  He 
defined  the  commonality  among  the  tests  as  representing  general 
intelligence,  labeled  g,  whicli  composed  the  first  factor  of  his 
theory.  The  second  factor  of  his  theory,  s,  represented  the  many 
factors  specific  to  each  test  or  type  of  test. 

Faith  in  the  existence  of  Spearman's  g  faded  following 
Thurstone's  (1938)  application  of  liis  centroid  factor  method. 
Thurstone  did  not  locate  g,  or  any  common  factor,  in  a  battery 
of  56  tests.  Upon  this  evidence,  Thurstone  developed  his  theory 
of  Primary  Mental  Abilities,  which  claimed  that  seven  unique 
factors  exist,  each  representing  a  separate  mental  ability,  and 
that  no  single  index  could  satisfactorily  explain  the  concept  of 
intellectual  ability. 

Thurstone's  rejection  of  the  existence  of  g  did  not  stand 
for  long.  Spearman  reviewed  Thurstone's  procedures,  and  by 
reanalyzing  the  data,  did  locate  tlie  general  factor.  In  addition, 


A 


when  Thurstone  attempLed  to  devise  tests  to  measure  liis  primary 
factors,  he  found  that  the  primaries  were  intercorrelated ,  rather 
than  independent  as  he  had  hypothesized  (McNemar,  196A). 

Though  Thurstonc  devoted  many  years  to  the  task  of  searching 
for  pure  measures  of  distinct  abilities,  he  was  unable  to  produce 
tests  wliicli  remained  uncorrelated  with  one  anotlier  when  based  on 
a  large,  representative  population  (Jenson,  1986).  'Miis 
phenomenon,  known  as  Positive  Manifold,  shows  that  mental  ability 
tests  with  adequate  reliabilities,  when  administered  to 
representative  populations,  will  always  result  in  (ositive 
intercorrelations.  Thurstone  finally  admitted  that  a  general 
factor  was  required  to  explain  the  intercorrelations  among  his 
primary  factors  (Thurstone  Thurstone,  19A1). 

Despite  the  preliminary  evidence  in  support  of  Spearman's 
g,  the  general  movement  in  test  development  began  to  lean  toward 
measurement  of  specific  abilities,  dowTiplaying  the  concept  of  g. 

Between  World  Wars  1  and  II,  psychologists  liypotliesi zed  that 
batteries  of  tests  measuring  specific  aptitudes  could  produce  a 
composite  score  that  would  comprise  an  a|)tifiKle  test  tailored  to 
a  particular  job  or  curricu'um.  Hunter  (198A)  stated  this 
hypothesis  was  not  tested  until  recently,  and  fails  in  all  btit  a 
lew  si'ccial  cases.  Though  untested,  the  concept  of  differential 
aptitude  testing  became  particularly  ap|)ealing  to  the  military, 
due  to  rapid  advances  in  militarv  teclinologv.  During  World  War 
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.11,  many  leading  psychometricians  applied  multiple  regression 
strategies  to  aptitude  batteries  in  order  to  optimally  predict 
performance  in  technical  military  career  fields.  Multiple  or 
differential  aptitude  batteries  were  employed  in  the  civilian 
sector  as  veil. 

Validation  results  of  sSuch  batteries  have  led  to  an 
emphasis  on  quantitative  and  verbal  abilities,  and  also  on 
technical  aptitudes  particularly  for  military  batteries.  The 
three  most  commonly  used  differential  aptitude  batteries  at 
present  are:  the  ASVAB,  developed  by  the  Department  of  Defense 
(DoD),  (U.S.  DoD,  198A);  the  General  Aptitude  Test  Battery 
(OATD,  developed  by  the  U.S.  Employment  Service  (USES,  1970); 
and  the  Differential  Aptitude  Test  (DAT),  published  by  The 
Psychological  Corporation. 

The  ASVAB 

Tl>e  ASVAB  was  developed  in  the  19b0's  for  the  purpose  of 
.selection  and  classification  of  United  States  military  personnel. 
Previously,  the  Army,  Navy,  Marines,  and  Air  Force  used  test 
batteries  that  were  developed  separately  by  each  service. 

The  Selccrtivc  Service  Act  of  19A8  led  to  the  development  of 
the  Armed  Forces  (Qualification  Tc.nl  (AFQT),  the  goal  of  which  was 
to  promote  a  more  equitable  di.stributiori  of  abilities  among  the 
various  services.  The  AFQT  w-as  developed  in  a  joint- .services 
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project,  for  administration  to  all  military  applicants. 
Standardization  of  the  AFQT  against  the  Army's  entrance  exam, 
the  Army  Genera]  Classification  Test  (AGCT)  utilized  test  scores 
of  all  military  enlisted  personnel  as  of  December  31,  19AA.  This 
group  became  a  reference  population  for  subsequent  AFQT  versions. 
The  AFQT  has  since  been  used  to  compare  the  distribution  of 
abilities  of  accessions  among  the  services,  to  screen  applicants 
for  enlistment,  and  to  report  accession  quality  to  the  U.S. 
Congress. 

In  1958,  the  Airman  Qualifying  Examination  (AQE)  was 
introduced  into  high  schools  by  the  Air  Force  for  the  purpose  of 
assisting  in  occupational  counseling.  Then  in  1966,  DoD 
initiated  development  of  a  single  test  to  be  used  for  selection 
and  classification  by  all  services.  This  joint-services  project 
resulted  in  the  ASVAB.  Form  1  of  the  ASVAB  replaced  the  AQE  for 
high  school  use  in  1968,  and  it  was  replaced  by  alternate  form  2 
in  1973.  Alternate  form  3  was  used  for  Air  Force  recruiting 
beginning  in  1973,  and  by  the  Marines  in  1975. 

Early  successes  of  the  ASVAB  led  to  its  use  as  the  sole 
instrument  for  selection,  classification,  and  assignment  of  all 
service  personnel.  In  1976,  ASVAB  form  5  was  provided  to  high 
schools,  while  parallel  forms  6  and  7  were  provided  to  Military 
Entrance  Processing  Stations  (MEPSs),  thus  replacing  the 
batteries  used  by  the  individual  services. 
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The  Air  Force  Human  Resources  Laboratory  (AFHRL)  is 
currently  the  lead  laboratory  responsible  for  continuing  ASVAB 
research,  development,  and  validation.  ASVAB  forms  8,  9,  and  10, 
which  were  developed  to  provide  more  accurate  measures  at  lower 
levels  of  ability,  were  placed  in  use  in  October,  1980. 
Replacement  forms  11,  12,  13,  and  lA,  developed  to  be  parallel  to 
forms  8,  9,  and  10,  began  use  in  October,  198A,  and  are  the  forms 
currently  used. 

The  1980  Profile  of  American  Youth,  a  DoD-sponsored  study, 
provided  a  more  current  reference  population  against  which  ASVAB 
scores  could  be  interpreted.  A  nationally  representative  sample 
of  about  12,000  men  and  women,  aged  16  to  23,  took  form  8AX 
(identical  to  form  8A)  of  the  ASVAB.  Bock  and  Mislevy  (1981) 
provide  supporting  data  as  to  the  representativeness  of  the 
sample,  and  the  accuracy  of  the  test  and  procedures  used. 

The  content  of  the  ASVAB  forms  8  through  lA  consists  of  10 
subtests;  General  Science  (GS),  Arithmetic  Reasoning  (AR), 

Word  Knowledge  (WK),  Paragraph  Comprehension  (PC),  Numerical 
Operations  (NO),  Coding  Speed  (CS),  Auto  and  Shop  Information 
(AS),  Mathematics  Knowledge  (MK) ,  Mechanical  Comprehension  (MC), 
and  Electronics  Information  (El).  The  subject  areas  have  been 
chosen  according  to  their  abilities  to  predict  success  in 
military  training  courses. 

The  current  AFQT  used  by  all  services  consists  of  a 
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composltp  of  four  ASVAB  subtests  (AR,  WK,  PC,  and  a  half- 
wclRhtinp,  of  NO;  see  Table  1).  The  AFQT  is  used  for  initial 
selection  or  rejection  of  applicants.  Each  of  the  services 
employs  additional  subtest  conliguratioi^s  for  classification 
purposes.  The  Air  Force  utilizes  four  such  composites; 

Mechanical  (M),  Administrative  (A),  General  (G),  and  Electronic 
(i:).  (See  Table  1  for  compositions  of  these  composites.)  The 
composites,  or  aptitude  indexes  (AIs),  were  developed  through 
multiple  regression  procedures  (Hunter,  Crosson,  &  Friedman, 
19H5). 

In  the  recruitment  process,  an  applicant  is  administered  a 
current  form  of  tlte  ASVAB.  His/her  AFQT  score,  if  sufficiently 
high,  quallfiee  the  Itidlvidual  for  enlistment.  The  AI  scores 
me  lltofi  used,  in  addition  to  manning  recpilrements  and  the 
efilJstee's  prefcroncon,  to  place  the  Individual  in  a  training 
program  for  a  specific  job,  or  in  a  career  field  following 
succoHfiful  cornploLion  oi  Basic  Mililary  Training.  Some  highly 
flclecllvt!  Air  Force  career  fields  employ  higher  AI  cutoff  scores 
tlinn  others. 

Studies  of  ASVAB  reliabilities  have  yielded  results 
Indlcatliig  that  the  ASVAB  subtests  have  satisfactory  reliability. 
Hee,  Mullins,  Mathews,  and  Massey  (1982)  computed  internal 
consInlrMicy  i  eliabi  1  itles  for  the  eight  power  subte.sts  for  ASVAB 
Ha,  Hli,  9a,  9b,  lOu,  and  IQh.  The  average  subLest  reliabilities 


Table  1 


ASVAB  Subtest  N  of 

'  Items 

AFQT 

M^ 

A*’ 

g' 

General  Science  (GS) 

2S 

X 

Arithmetic  Reasoning  (AR) 

30 

X 

X 

Word  Knowledge  (WK) 

35 

X 

X 

X 

Paragraph  Comprehension  (PC) 

15 

X 

X 

X 

G  f 

Numerical  Operations  (NO) 

50 

X 

X 

Coding  Speed  (CS)^ 

8A 

X 

Auto  and  Shop  Information  (AS)® 

25 

X 

1 

Mathematics  Knowledge  (MK) 

25 

1 

Mechanical  Comprehension  (MC) 

25 

1 

X 

Electronics  Information  (El) 

20 

M  =  Mechanical  Composite 
A  =  Administrative  Composite 
G  =  General  Composite 
E  =  Electronic  Composite 

NO  and  CS  are  speeded  sublests.  AIL  others  are  puwer  aubtests, 
NO  is  weighted  at  ^  for  AFQT  inclusion. 

AS  is  double-weighted  for  inclusion  in  the  M  composite. 
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across  all  forms  ranged  from  .81  to  .92.  Reliabilities  of  the 
two  speeded  subtests  (NO  and  CS)  were  indirectly  inferred  from 
subtest  intercorrelations,  which  were  observed  to  range  from 
.53  to  .70  (U.S.  DoD,  1984). 

Hunter,  Crosson,  and  Friedman  (1985)  reported  reliabilities 
for  all  10  subtests  as  computed  by  extrapolation  from  previous 
data  (Friedman,  Streicher,  Wing,  Grafton,  &  Mitchell,  1983;  Kass, 
Mitchell,  Grafton,  &  Wing,  1982).  Hunter  and  his  associates 
utilized  a  reliability  theory  formula  which  forecasts  estimates 
from  one  population  to  another  which  has  a  different  variance, 
thereby  attempting  to  show  what  the  ASVAB  form  8,  9,  and  10 
subtest  eorrelations  would  have  been,  were  there  no  error  of 
measurement.  Correeted  reliabilities  thus  eomputed  ranged  from 
.66  to  .85. 

Although  internal  consistency  reliabilities  provide  useful 
information,  parallel-forms  (or  alternate-forms)  reliabilities 
are  preferred.  Parallel-forms  methods  correspond  to  the  classic 
definition  of  reliability  as  the  ratio  of  true-score  variance  to 
observed-score  variance.  Such  methods  avoid  the  possible  bias 
caused  by  time  effects,  as  encountered  in  test-retest  methods. 
Parallel-forms  methods  also  may  be  used  with  quickly-paced  tests 
(such  as  the  two  speeded  subtests  of  the  ASVAB,  NO  and  CS), 
whereas  internal  consistency  methods  will  overestimate  the 


reliabilit jes  of  such  tests. 
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Palmer,  Hartke,  Ree,  Welsh,  and  Valentine  (1988)  computed 
parallel-forms  reliabilities  for  subtests  and  composites  of 
ASVAB  forms  8,  9,  10,  and  11.  (The  results  apply  to  forms  8 
through  lA,  os  all  these  forms  ore  parallel ).  Their  results, 
based  on  data  from  a  sample  of  75,000  armed  service  applicants, 
indicate  that  for  each  subtest,  reliabilities  are  similar  acros.'? 
all  forms.  The  coefficients  observed  for  the  subtests,  across 
all  forms,  ranged  from  .67  to  .88.  The  composite  coefficients 
(including  the  AIs  and  the  AFQT)  ranged  from  .87  to  .93. 
Reliability  coefficients  were  observed  to  be  higher  in  general 
for  males  and  whites  than  for  females,  blacks,  or  hispanics, 
although  the  reliabilities  for  females,  blacks,  and  hispanics 
remained  adequate.  Reliability  coefficients  for  females  ranged 
from  .56  to  .92;  for  blacks,  from  .83  to  .90;  for  hispanics, 
from  .80  to  .90. 

This  compilation  of  studies  indicates  stable,  satisfactory 
levels  of  reliability  across  ASVAB  forms,  subtests,  and 
composites. 

The  statistical  validity  of  the  ASVAB  is  an  estimation  of 
how  useful  it  is  as  a  tool  to  predict  job  performance  in  military 
career  fields.  Job  performance  measures  have  not,  however, 
proven  to  be  available  criteria  for  estimating  ASVAB  validities, 
as  tliere  are  no  measures  which  can  be  commonly  applied  across 
all  military  occupations.  However,  technical  training  course 
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f’radcs  have  been  used  for  several  decades. 

ASVAB  validity  coefficients  are  computed  by  correlating  the 
test  scores  with  training  school  performance  measures.  Training 
school  grade.s  provide  objective  measures,  and  are  obtainable 
across  all  occupations.  Furtliermore,  the  content  of  training 
programs  is  established  according  to  job  performance  requirements. 
Within  tlic  Air  Force,  occupational  analyses  based  on  the 
Instructional  Systems  Development  (ISD)  process  are  performed  for 
each  career  field.  These  analyses  utilize  the  Comprehensive 
Occupatlotial  Data  Analysis  Program  (CODAP)  to  ensure  that 
training  school  content  is  based  upon  required  job  knowledge 
(Stacey,  Weismuller,  Barton,  &  Rogers,  197^).  Hunter  (198^)  has 
staled  that  job  knowledge  and  job  performance  measures  are  very 
highly  correlated: 

Cognitive  ability  proved  to  be  a  very  good  predictor  of  job 
knowledge  and  is,  thus,  a  good  predictor  of  job  performaice.. 
To  a  lesser  extent,  cognitive  ability  also  predicts  job 
performance  directly  ....  data  using  work-sample  tests 
show  tlwit  there  is  a  very  high  correlation  between  job 
knowledge  and  .job  performance.  People  who  do  not  know  much 
Hl)out  the  job  will  perform  poorly.  The  multiple  regression 
of  work  saiii|)lG  performance  onto  ability  and  knowledge  shows 
that  it  is  job  knowledge  which  has  the  larger  direct  impact 
on  performance,  (p  52  and  53) 
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Thus  not  only  do  job  knowledge  measures  accurately  predict  job 
performance,  but  cognitive  ability  measures,  such  as  technical 
school  grades,  are  highly  appropriate  for  assessing  job  knowledge. 

Each  of  the  military  services  conducted  validation  studies 
for  ASVAB  forms  8,  9,  and  10,  in  1983.  Detailed  results  and 
conclusions  from  these  studies  are  presented  in  the  ASVAB  Test 
Manual  (U.S.  DoD,  1984).  For  all  services,  reported  validities 
are  sufficiently  strong  to  predict  training  success.  For  Air 
Force  data,  no  major  validity  differences  were  seen  between 
black  and  white  or  male  and  female  subgroups,  for  the  specialties 
which  had  adequate-sized  samples  and  for  which  validities  reached 
useful  levels. 

Numerous  extraneous  variables  can  effect  the  measurement  of 
the  validity  of  occupational  tests.  Determination  of  ASVAB 
validities  is  complicated  by  the  effects  of  range  restriction  in 
both  the  lower  and  upper  score  ranges.  Applicants  in  the  low  end 
of  the  scale  are  eliminated  from  consideration,  and  thus  from  the 
validation  sample.  Likewise,  those  whose  scores  fall  above 
certain  cutoff  scores  may  be  selected  for  certain  exclusive 
career  fields,  thereby  also  contributing  to  a  restriction  in 
range.  Corrections  for  range  restriction  may  be  applied,  and 
will  be  discussed  in  conjunction  with  meta-analytlc  procedures. 

Additional  variables  which  may  effect  ASVAB  validity  data 
are  the  differences  among  training  course  difficulty,  course 


content,  and  parading  systems.  Courses  can  vary  from  weeks  to 
months  in  length,  and  from  basic  to  highly  technical  skills. 
Impacting  on  these  variables  is  the  number  of  students  per 
course.  Some  longer,  more  technical  courses  require  a  lengthy 
time  period  to  develop  an  adequate-sized  sample  for  validation 
purposes.  Also,  variations  in  grading  systems  exist,  such  as 
pas.'^/fail  systems  versus  letter  grades,  which  may  impact  on 
validity  measures. 

Personnel  Selection 

Employee  selection  procedures,  including  recruiting, 
interviewing,  testing,  and  validating  selection  criteria  cost 
organizations  billions  of  dollars  per  year.  Choice  of  selection 
methods,  and  validation  of  those  methods  are  thus  of  vital 
concern  to  employers.  In  addition  to  ensuring  selection  of 
quality  personnel,  employers  must  ensure  their  selection 
procedures  are  within  the  requirements  of  federal  guidelines 
(Uniform  Guidelines  on  Employee  Selection  Procedures,  1978). 

The  guidelines  are  provided  with  the  intention  of  promoting 
equal  employment  opportunities  among  all  persons,  but 
specif ically  for  certain  protected  group.s.  The  guidelines 
require  validation  studies  for  each  occupation  for  which  a 
selection  instrument  is  used,  if  tliat  instrument  adversely 
effects  firing  of  the  specified  proteclod  groups. 
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While  attempfiriR  to  avoid  adverse  impact  in  hiring,  the 
guidelines  urge  employers  to  seek  selection  procedures 
alternative  to  tests,  but  that  are  equally  as  valid.  That 
standardized  tests  are  valid  methods  of  personnel  selection  has 
been  sufficiently  supported  by  Ghlselll  (196f),  1973),  wlio 
reviewed  hundreds  of  criterion-related  validity  studios,  lie 
concluded  that  '  for  every  job,  there  la  at  least  one  ty[)e  of  Lest 
which  has  at  least  moderate  validity"  (Ghiselli,  1973,  p  A77-478). 
Attempts  to  validate  alternative  selection  methods,  however,  have 
not  met  with  such  success.  Reilly  and  Chao  (1982)  reviewed 
research  on  the  validities  of  eight  categories  ol  selection 
methofls:  1.  biographical  data  (biodata),  2.  intei views, 

3.  peer  evaluation,  A.  self-assessment,  5.  reference  checks, 

6.  academic  performance  measures,  7.  expci  t  Judgement,  atid 
8,  projective  techniques.  They  concluded; 

Only  biodata  and  peer  evaluation  were  supporteil  ns  having, 
validllics  substantially  equal  to  those  for  standardized 
tests  ....  data,  wlierc  avallublo,  oilered  no  clear 
indication  lliat  any  of  Llie  allernnliveH  m<!l  the  rrlletJon 
of  having  equal  validity  with  Jesa  adverse  impact,  (p  1) 
Hunter  and  Hunter  (19H4)  nssessed  validities  ol  vat  Jons 
predictors,  witli  training,  peilormance  delermlned  thrnug.h 
supervisor  rating, s  (N  ranged  Irom  1  ,  7H9  to  32,12A).  AltlllLy 
comt)OslleH  provided  I  lie  lilgheai  /ivemg.e  validity,  rtf  .53, 
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followed  by  job  tryouts  at  .Uk,  and  biodata  at  .37.  The 
remolninp,  eipht  predictors  rnnp.ed  in  validity  from  -.01  to  .26. 

The  Uniform  Guidelines'  test  vnlidatlon  rerpn roment.s  are 
based  on  the  Ijclicf  that  a  te.st  whicli  has  demonstrated  validity 
in  one  situation  or  proufi  will  not  fiecessnr  I  ly  have  ncreptable 
validity  in  other.s.  This  exemplifies  t  lie  tlieory  of  situational 
sjirM:  1  flej  ly,  which  Is  the  contention  tliat  althotip,)i  joh.s  may 
nppear  very  similar,  there  are  suhllo,  yet  Important  differences 
In  jobs,  tank.s,  indlvIdualH,  or  Identllinble  nubp, roups  of 
Individuals  wlilcli  tnoderato  the  iiredirtor-crl  ter  ion  relationship. 
For  example,  a  lest  found  to  .ulrruiately  predict  performance  of 
Cblcapo  policemen  mlj;lu  ,  duo  to  racial  or  p/toprajihic  differences, 
or  other  unknown  and  unspecl flnhlo  differences,  bo  invalid  for 
Son  Francisco  policemen. 

Prevalence  of  the  theory  of  situational  specificity  resulted 
in  numerous  validation  studios  for  sinalJ  p, roups,  and  for  nearly 
identical  settiiip.s.  The  validity  p.enera  1 1  zal.  1  oi’  work  of  Sclunidl 
ntid  llunler  oiid  their  nssoci !iI(?h  Iiju*  shown  this  theory  to  be 
qiK’Ml ) onab le  (.Sclpnldl  ^  Hunter,  1977;  .Sclimldt,  Hunter, 

I’earlmari,  6  .Sliaii.',  1979;  .Sebmidt  ,  Hunter,  ft  lirry,  1976).  Tboy 
contend  that,  observed  dinetencea  lii  validities  across  similar 
situiitlons  and  jrjbs  are  almost  entirely  due  to  statist  Ic.sl 
nrlllaeiM.  'Ils’lr  results  have  <-on,'ii slenl Jy  liidle/iU'd  that  lest 
vall<lltles  are  hip, lily  t  rsnsfiorlable  across  proiifis  and  siluatlotis. 
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Societal  consequences  of  the  Importance  of  intelligence 
(or  g)  and  the  ability  to  measure  it  have  definite  implications 
regarding  personnel  selection  procedures.  The  re.search  has  shovm 
that  a  g  factor  exists,  that  it  is  relevant  to  job  performance, 
and  that  it  can  be  assessed  by  use  of  standardized  tests.  What, 
then,  are  the  societal  implications  of  utilizing  tests  as 
selection  techniques?  Two  major  theories,  Functionalist  and 
Revisionist,  have  argued  that  differences  in  intelligence  are  of 
little  or  no  importance  in  the  work  pi ace. 

The  Functionalist  theorists  assume  that  education  provides 
job-relevant  knowledge  recpiired  for  diverse  occupations.  They 
believe  also  that  differences  in  job  performance  result  primarily 
from  specific  skills  that  have  been  learned,  not  from  a  general 
ability,  and  that  access  to  education  is  not  equally  obtainable 
for  all  individuals.  Functionalist  reforms  would  focus  on 
revising  educational  policies  by  providing  more  equal  access  for 
underprivelaged  groups  (^ottfredson,  1986). 

Revisionist  beliefs  are  similar,  yet  are  more  extreme.  They 
argue  that  an  occupational  hierarchy  is  unnecessary,  and  promotes 
unjustifiable  differences  in  rewards.  According  to  Gottfredson 
(1986): 

The  revisionist  perspective  argues,  moreover,  that  the  high 
differentiation  and  specialization  of  work  activities  in 
our  society,  as  well  a.s  t!ie  accompanying  large  differences 


in  the  ent'nnce  requirements  end  socioeconomic  rewards 
amonp,  occupations,  are  tlie  result  of  capitalists 
Hitentionally  frapmentinp,  and  "de-skilling"  work  in  order 
to  increase  not  the  efficiency  of  work,  but  their  control 
over  workers,  (p  382) 

The  soJution  to  the  problem,  as  the  Revisionists  see  it,  is  to 
restructure  tlie  jobs  themselves,  or  the  personnel  selection 
procedures,  rather  than  revise  education  or  training.  They 
assume  the  outcome  to  be  f  .icreased  equality  and  productivity. 

These  theorists  fail  to  consider  the  most  basic  concern  of 
the  hiring  organizations,  namely,  the  need  for  obtaining 
qualified  personnel.  It  is  extremely  difficult  for  employers  to 
be  able  to  predict  each  employee's  exact  v;orth.  During  the 
hiring  procedure,  employers  can  only  assume,  at  best,  a  level  of 
productivity,  or  worth,  for  specific  individuals.  Through 
trial-and-error  procedures,  organizations  have  come  to  identify 
indicators  of  applicants'  potentials,  educational  credentials 
have  come  to  be  an  extensively  used  indicator  of  success  mai-ily 
due  to  their  moderately  high  correlation  with  intelligence 
measures  (r  =  .60)  (Gottfredson,  1986). 

Attempts  at  social  revisions  with  the  dual  goals  of 
increasing  botli  equality  and  productivity  have  met  with  little 
success.  A  trade-off  between  the  two  is  inevitable.  Attempts  to 
increase  equality  by  randomly  assigning  workers  to  differentially 
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j>-loadcd  jobs  would  result  In  decreased  productivity.  Conversely, 
increasing  the  validity  of  job-person  matching,  though  perceived 
fair  ai  '  equitable,  would  further  exaggerate  the  pr<=sent 
occupational  hierarchy  (Gottfredson,  1986). 

A  popular  criticism  of  .selection  testing  is  that  it 
promulgates  racial  and  ethnic  inequalities,  by  identifying  as 
more  qualified  the  individuals  v;ho  are  white,  or  are  from  middle 
class  or  higher  socioeconomic  backgrounds.  Major  supporters  of 
this  belief  liave  been  Ralph  Nader  and  Allen  Nairn  (Nairn,  1980). 
Nader's  criticism  of  the  testing  industry  focused  primarily  on 
the  Scholastic  Aptitude  Test  (SAT),  a  college-entrance  exam 
produced  by  the  Educational  Testing  .Service  (ETS).  Racial  and 
ethnic  criticisms  against  the  use  of  standardized  g-loaded  tests 
have  simply  not  held  up  technically.  Research  on  this  question 
ha.s  consistently  revealed  that  ethnic  groups'  score  differences 
on  ability  tests  represent  true,  consistent  differences  between 
the  group.s.  .Spearman  (1927)  reported  marked  differences  in  mean 
scores  of  blacks  and  whites  on  a  battery  of  10  higlily  g-loaded 
tc.sts.  Jensen  (1985)  sustained  and  corroborated  Spearman's 
findings,  and  reported  that  no  studies  had  been  f(.  ad  to 
contradict  them.  The  diverging  distributions  of  g  among  races 
and  g,enders  will  likely  result  in  a  proportionately  lower  number 
of  blacks,  hispanics,  and  women  selected  than  whites  and  men, 
via  cognitive  ability  measures. 
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That  test  results  may  produce  ethnic  disparity  in  numbers 
hired  does  not  lessen  the  strength  of  the  evidence  for  g,  nor  can 
it  weaken  the  validity  of  the  tests.  The  theory  of  differential 
vallidity,  which  contends  that  validities  are  different  for 
various  groups  of  people,  has  been  repeatedly  disconfirmed 
(Bartlett,  Bobko,  Hosier,  &  Hannon,  1978;  Hunter,  Schmidt,  & 
Hunter,  1979;  Schmidt,  Pearlman,  &  Hunter,  1980).  The  model  of 
test  fairness  adopted  in  the  Uniform  Guidelines  on  Employee 
Selection  Procedures  (1978)  is  the  regression  model,  which 
defines  a  test  as  unfair  if  it  predicts  lower  levels  of  job 
performance  for  a  minority  group  than  that  group  actually 
achieves.  As  stated  by  Schmidt  and  Hunter  (1981)  "The 
accumulated  evidence  on  this  theory  is  clear:  Lower  test  scores 
among  minorities  are  accompanied  by  lower  job  performance, 
exactly  as  in  the  case  of  the  majority"  (p  1131). 

We  have  seen  evidence  for  the  diverging  distributions  of 
test  scores  of  general  cognitive  ability  measures  for  blacks 
versus  whites.  Can  we  expect  similar  results  for  men  versus 
women? 

Literature  reviews  (Jensen,  1980;  Maccoby,  1966;  Maccoby 
&  Jacklin,  197A)  have  identified  three  areas  in  which  gender 
differences  appear:  verbal,  quantitative,  and  spatial  abilities. 

Girls  appear  to  verbally  outperform  boys  beginning  at  very 
early  ages,  perhaps  even  with  their  first  words  (McCarthy,  195A). 


The  differences  are  small,  and  boys  begin  to  catch  up  until  about 
age  10  or  11.  After  puberty  however,  "girls  average  close  to  a 
quarter  of  a  standard  deviation  higher  than  boys  on  verbal  tasks 
.  .  .  .  The  sex  difference  in  verbal  ability  after  puberty 
appears  to  be  a  genuine  phenomenon  and  not  just  a  measurement 
artifact"  (Jensen,  1980,  p  625).  Jensen's  review  of  studies 
published  since  1966  showed  that,  of  58  studies  investigating 
general  intelligence,  15  significantly  favored  females,  whereas 
only  3  favored  males.  This  is  to  be  expected,  in  light  of  the 
knowledge  that  many  general  intelligence  tests  are  heavily 
w'iighted  on  verbal  ability,  and  require  reading. 

The  pattern  of  quantitative  ability  is  exactly  the  opposite 
of  verbal  ability.  Boys  are  clearly  superior  as  adolescents,  and 
this  difference  remains  throughout  life.  Differences  of  one- 
fifth  to  two-thirds  of  a  standard  deviation  have  Teen  observed  by 
the  end  of  high  school.  This  is  again  a  true  difference,  and  not 
an  artifact  of  test  bias.  After  equating  males  and  females  on 
the  number  of  courses  taken,  males  still  emerge  superior, 
suggesting  that  the  difference  is  not  due  to  greater  training  for 
males  {Maccobv  .5  Jack!  in,  197A). 

The  third  area  of  observed  differences,  spatial  ability,  has 
been  difficult  to  define  and  investigate.  Smith's  (1964) 
definition  was  "the  perception,  retention,  and  recognition  (or 
reproduction)  of  a  figure  or  pattern  in  its  correct  proportions” 
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(p  96),  Numerous  skills  have  been  proposed  as  relative  to 
spatis)  ability,  Including:  auditory  localization,  size- 
distance  constancy,  tactual  recognition  of  objects  as  they 
change  in  space,  and  matching  pairs  of  items  that  are  mirror- 
reversals  or  that  have  been  rotated  in  space,  bespite  the 
difficulties  encountered  iji  clarifying  this  concept,  researchers 
have  consistently  Indicated  that  males  are  superior  on  spatial 
abilities,  particularly  after  adolescence.  Maccoby's  (1966) 
review  showed  an  advantage  for  boys  on  the  DAT  and  the  Primary 
Mental  Abilities  Test.  Jensen  (1980)  stated  that  only  about 
one-fourth  of  the  females  exceed  the  male  mean  on  spatial- 
visualization  tests.  The  reason  for  this  difference  is  still 
unclear.  Proposed  causal  factors  include  maturation  rates, 
hoimonal  factors,  sex-linkage,  and  cultural  effects. 

In  general,  females  perform  better  than  males  on  verbal 
tasks,  and  somewhat  better  than  males  on  general  intelligence 
tests,  while  males  are  superior  on  quantitative  and  spatial 
tasks. 

As  it)  the  case  of  racial  differences,  the  gender  differences 
discovered  do  not  adversely  eftert  interpretations  of  test 
vaJidlties.  Differential  predictive  validity  of  tests  by  sex 
ti/iM  coiicent  rated  on  |)rediction  ol  college  grades.  In  Jensen's 
(1980)  review,  he  located  no  studies  reporting  lower  validities 
for  wom(*n.  For  the  ASVAIl,  as  prcviotisly  discussed,  no 
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sij’nif.icant  differences  were  observed  between  validity  data  of 
males  and  females. 

Another  consideration  regarding  personnel  selection  is  the 
cost.  As  stated  previously,  billions  of  dollars  are  spent  for 
hiring  programs  each  year.  According  to  Hunter  and  Schmidt 
(1982),  most  major  corporations  abandoned  the  use  of  selection 
tests  during  the  previous  10  years,  in  order  to  comply  more 
fully  with  federal  guidelines.  During  this  same  period,  the 
rate  of  growth  in  productivity  declined  from  three  and  a  half 
percent  per  year,  to  zero  and  even  below.  The  authors  attributed 
this  decline,  at  least  in  part,  to  decreased  test  usage.  After 
dramatically  lowering  their  testing  standards  to  require  only 
minimum  scores  (at  approximately  a  seventh  grade  level),  IJ.S. 
Steel's  detailed  training  records  showed  that: 

1.  Scores  on  mastery  tests  given  during  training  declined 
markedly ; 

2.  the  flunk-out  and  drop-out  rates  increased  dramatically; 

j.  average  training  time  ajid  training  costs  for  those  who 

did  make  it  through  the  program  increased  substantially, 
and 

it.  average  ratings  of  later  performance  on  the  job  declined. 
(Hunter  &  Schmidt,  1982,  p  298). 

Schmidt,  Hunter,  Outerbrldge,  &  Trnttner  (1986)  have 
empirically  estimated  the  economic  impact  of  various  sclectloti 
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methods  for  the  federal  work  force.  They  determined  that,  for 
the  federal  government,  a  one-year  cohort  based  on  valid, 
cognitive  ability  measures,  would  increase  productivity  values 
up  to  six  hundred  million  dollars  for  each  year  the  employees  are 
retained.  Hunter  and  Schmidt  (1983)  estimated  a  labor  cost 
savings  of  15  to  20  percent  for  any  organization,  due  solely  to 
selection  based  on  cognitive  ability. 

Without  doubt,  the  legal,  societal,  and  financial  aspects  of 
personnel  selection  must  be  of  vital  concern  to  any  organization. 
According  to  John  Hawk  (1986),  employees  who  are  selected  on  the 
basis  of  validity  generalization  (based  on  the  concept  of  g)  are 
more  pro  ictive,  learn  faster,  are  more  quality  conscious,  and 
are  more  satisfied  with  their  jobs.  This  results  in  lower 
turnover  rates.  Ability  tests  remain  the  most  valid  personnel 
selection  procedure  known.  The  future  of  personnel  selection 
procedures  appears  to  lie  in  the  direction  of  improving  testing 
programs,  rather  than  in  the  search  for  alternatives  to  testing. 

Validity  Generalization  and  Meta-analysis 

Schmidt  and  Hunter  and  their  associates  have  extensively 
investigated  the  feasibility  of  transporting  test  validities 
across  groups  and  situations.  Such  validity  generalization 
research  counters  the  claim  of  Llie  situational  specificity 
hypothesis.  By  generalizing,  or  "borrowing"  validities, 


researchers  have  shown  that  nwuerous,  costly  validation  studies 
are  unnecessary. 

The  concept  of  generalizing  results  across  studies  or 
situations  is  not  new.  For  many  years  scientists  have  been 
combining  results  from  numerous  independent  studies  in  order  to 
infer  a  general  conclusion.  Integrative  reviews  have  employed 
various  methods,  including  vote  counting  (counting  of  positive 
and  negative  results),  counting  significant  results,  and 
averaging  statistics  across  sttidles.  Ro.senlhal  (1978)  descrilws 
a  number  of  statistical  combination  procedures. 

As  a  vanguard  of  the  validity  generalization  movement. 

Glass  (1976,  1977)  devised  a  statistical  method  he  entitled 
meta-analysis.  His  method  combines  the  results  of  individual 
studies  by  converging  the  results  into  a  common  metric,  coding 
various  characteristics  of  the  studies,  then  using  conventional 
statistics  to  determine  whether  there  is  an  overall  effect. 

.Schmidt  and  Hunter  and  their  associates  have  built  a 
mota-analytic  procedure  based  on  the  ideas  of  Glass,  but 
incorporated  some  modifications.  The  Schmidt-Hunter  tiKKicI, 
utilizing  principles  of  Bayesian  statistics,  improves  on  Glass's 
procedure  by  providing  corrections  for  sources  of  error. 

Schmidt,  Hunter,  Pearlman,  and  Shane  (1979)  identified  seven 
artifactu.nl  sources  of  variance: 

1.  Differences  between  studies  in  criterion  reliability; 
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2.  dlfffirences  between  studies  in  test  reliability; 

3.  differences  between  studies  in  range  restriction; 

k.  sampling  error  (variance  due  to  N  <oo  ); 

5.  differences  between  studies  in  amount  and  kind  of 
criterion  contamination  and  deficiency; 

6.  computational  and  typographical  error,  and 

7.  slight  differences  in  factor  structure  between  tests 
of  a  given  type,  (p  260-261) 

Schmidt,  et  al.,  do  not  correct  for  the  last  three  sources 
of  error,  linving  determined  that  it  is  difficult  to  estimate 
their  frequency  or  magnitude.  Not  correcting  for  these  error 
sources  results  in  conservative  variance  estimates.  They  also 
apply  a  conservative  decision  rule,  which  accounts  for  the  fact 
that  only  four  of  the  seven  sources  of  artifactual  variance  are 
corrected  for.  According  to  their  rule,  they  reject  the 
situational  specificity  hypotliesis  if  75  percent  or  more  of  the 
variance  of  the  validity  coefficients  is  accounted  for  by  the 
four  corrected  artifacl.s.  If  a  large  proportion  of  the  observed 
variability  is  attributable  to  artifacts,  the  conclusion  is  that 
the  true  population  variance  is  negligible,  and  validity 
coefi iclerUs  can  be  generalized  across  situations. 

A  central  concept  in  the  .Schmidt-tiunter  metliodology  is  that 
validation  attcinfit.s  have  historically  relied  on  insufficient 
HnmpJp  sizes,  and  have  thus  led  to  erroneous  conclusions, 
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because  researchers  have  interpreted  sampling  error  as  true 
variance  (Schmidt  &  Hunter,  1978).  Many  supposed  moderators  of 
predictor-criterion  relationships,  including  race,  ethnicity, 
sex,  age,  socioeconomic  status,  leadership  style,  and  geographic 
area,  have  been  identified  through  belief  in  the  "law  of  small 
numbers."  This  law  proposes  that  a  small  random  sample  can  be 
considered  as  representative  of  a  population  as  a  large  random 
sample.  The  error  of  this  proposition  can  be  shown  using  an 
example  of  race  as  a  moderator.  For  validation  studies,  minority 
samples  have  been  generally  smaller  than  for  the  majority.  In 
single-group  studies,  this  produced  a  large  number  of  white- 
significant,  black-nonsignificant  findings.  This  is  another 
example  of  interpreting  error  variance  (due  to  lack  of 
statistical  power)  rather  than  true  variance. 

Sciimidt  and  Hunter  (1978)  stated  that  the  differential 
validity  approach  is  sound,  but  requires  an  extremely  large 
.sample  size  in  each  group  (in  the  hundreds)  to  have  a  .90 
probability  of  detecting  true  differences.  This  is  a  recognition 
that  statistical  power  increases  with  the  square  of  the  sample 
size,  and  that  largo  samples  are  nece.ssary  to  produce  powerful 
statistical  tests.  The  small  sample  size  problem  encountered  in 
single-group  validity  studies  and  differential  validity 
approaches  are  common  among  job-test  validations,  because 
research  must  often  be  based  on  the  number  of  available  workers. 
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The  -  ,'nalytic  approach  combines  the  results  of  numerous 
small-sample  studies,  and  thus  provides  overall  results  based  on 
a  sufficient  sample. 

Applications  of  Schmidt  and  Hunter's  meta-analytic  procedure 
has  shown,  in  numerous  studies,  that  much  of  the  observed 
variation  in  validities  for  similar  job-test  combinations  is 
artifactual.  For  example,  Schmidt,  Hunter,  and  Caplan  (1981) 
investigated  the  transportability  of  validities  of  four  types  of 
cognitive  tests.  They  reported  that  ".stipport  for  general izabl  IJ  ty 
was  substantiated  for  general  mental  ability  and  arithmetic  tests" 
(p  261).  They  found  that  sampling  error  alone  accounted  for  90 
percent  of  all  variance  due  to  artifacts.  Additional  support  is 
given  by  Pearlman,  Schmidt,  and  Hunter  (1980),  Schmidt  and  Hunter 
(198^),  and  Schmidt,  Hunter,  Pearlmati,  and  Shane  (1979). 

The  evidence  shows  the  situational  specificity  hypothesis  to 
be  invalid.  For  a  given  job-test  combination,  the  four  sources 
of  error  corrected  for  by  meta-ana lyt 1 c  procedures  are  capable  of 
producing  as  much  variance  as  is  generally  observed  between 
validation  studies. 

Validity  Generalization  and  the  ASVAB 

It  appears  that  a  general  ability,  or  g,  factor,  servea  as  a 
link  among  validities  of  cognitive  ability  t(?sls.  Though  authors 
disagree  on  the  propor  definitions  of  g  and  Intelligence,  they 
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agree  that  differences  among  individuals  in  a  general  ability 
factor  are  largely  responsible  for  differences  in  success  and 
status  in  the  United  States  (Tyler,  1986). 

Though  general  cognitive  ability  is  seen  as  a  valuable 
predictor  of  performance,  with  some  validity  for  all  jobs,  there 
are  jobs  for  which  its  validity  is  relatively  low.  Data 
available  from  U.S.  Employment  Service  (USES)  validation  studies 
on  515  jobs  representative  of  those  in  the  Dictionary  of 
Occupational  Titles  (DOT)  point  toward  job  complexity  ns  a 
moderator  of  validities  (Hunter,  Crosson,  &  Friedman,  1985). 

Even  in  the  worst  cases,  however.  Hunter  concluded  an  average 
cognitive  ability  validity  of  .37  for  training  success  and  .32 
for  job  proficiency.  Additionally,  Hunter  et  al.  (1985) 
analyzed  existing  Army  data  (Helme,  Gibson,  &  Brogden,  1957), 
and  found  indications  of  higher  validity  for  cognitive  ability 
on  complex  tasks  (such  as  decision  making),  accompanied  by  higher 
validities  for  psychomotor  ability  measures  on  tasks  of  low 
complexity  (sucli  as  following  instructions,  as  in  "feeding”  and 
"off-bearing"  tasks.  Feeding  refers  to  putting  something  into  a 
machine;  off-bearing  is  removing  something  from  a  machine.) 

Thus  as  job  complexity  varies,  higher  predictive  validities  may 
he  obtained  by  shifting  between  cognitive  and  psychomotor 
abilities  as  the  primary  predictors. 

If  cognitive  ability  is  such  a  pervasive  concept,  what  then. 


is  its  relationship  to  the  ASVAB,  which  was  developed  to  reflect 
differential  aptitudes’  The  theory  of  differential  aptitudes 
would  propose  that  ASVAB  subtest  composite  (AI)  validities  will 
be  high  for  jobs  in  corresponding  occupational  areas,  and  lower 
for  unrelated  occupations.  For  specific  jobs,  poor 
predictability  of  one  aptitude  could  thus  be  offset  by 
emphasizing  another  aptitude  with  higher  validity.  Hunter, 
et  al.  (1985)  disbelieved  this  theory,  and  subsequently  showed 
it  to  be  untrue.  They  analyzed  ASVAB  scores  from  samples  of 
military  personnel  (N  ranged  from  107  to  13,90A)  according  to  the 
subtest  composites  used  by  high  schools  for  occupational 
counseling.  Composite  content  was  as  follows:  Verbal  = 

WK  +  PC  +  GS;  Quantitative  =  AR  +  MK;  Technical  =  AS  ♦  MC  +  El ; 
Speed  =  NO  +  CS.  Each  occupational  composite  was  found  to  be 
nearly  as  valid  for  other  occupational  areas  as  it  was  for  its 
own  occupations,  A  clear  lack  of  differential  validity  was 
observed. 

In  an  additional  analysis.  Hunter  et  al,  (1985)  analyzed 
an  Air  Force  sample  of  29,619  recruits,  with  AI  validities 
computed  for  70  jobs  (Wilbourr,  Valentine,  &  Ree,  198A),  After 
the  data  had  been  corrected  for  restriction  in  range.  Hunter 
and  his  associates  concluded  that  only  in  the  electronics  area 
was  the  AI  more  valid  for  its  corresponding  jobs  than  for  any  of 
the  other  composites.  In  the  same  study,  mechanical  and  general 
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occupational  peit'ormances  were  predicted  better  by  composites 
other  than  the  ones  designed  specifically  for  these  career 
fields.  Predictive  ability  of  a  General  Cognitive  Ability 
composite  was  found  to  be  as  high  as  those  for  Administrative 
and  Electronic  composites,  and  higher  than  those  for  the  other 
two  composites  (Mechanical  an<l  General). 

Viewing  such  results  as  these,  we  can  see  a  strong 
indication  of  a  General  Cognitive  Ability  factor  moderating  the 
relationships  among  ASVAB  validities.  According  to  Jensen 
(1986): 

Virtually  no  one  today  disputes  that  a  g  factor  can  be 
extracted  from  the  correlations  among  any  large  and  diverse 
collection  of  menta.T  ability  tests,  and  tliat  the  g  factor  is 
usually  substantiated  in  the  sense  of  subsuming  a  relatively 
large  proportion  of  the  total  variance  in  all  of  the  tests 
as  compared  with  other  factors  bcside.s  g.  The  point  that  is 
being  questioned  is  whether  the  g  lactor  represents  any 
reality  outside  the  operations  of  psychometric  tests  and 
factor  analysis,  (p  302) 

Extensive  evidence  does  support  the  existence  of  the  g 
phenomenon  beyond  psychometric  operations.  Non-psychometric 
correlates  of  g  have  been  identified  in  recent  year.s.  g  loadings 
have  been  found  related  to  heritabiiity  (Block,  1968;  Tambs, 
Sundet,  &  Magnus,  190A),  and  to  lest  correlations  among  family 
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members  (Nagoshi  &  Johnson,  1986).  A  review  by  Jensen  (1983) 
identified  12  studies  providing  evidence  that  g  loadings  are 
related  to  inbreeding  depression.  Other  reviews  (Eysenk  & 
Barrett,  1985;  Haier,  Robinson,  Braden,  A  Williams,  1983)  have 
led  to  the  conclusion  that  g  correlates  with  evoked  electrical 
brain  potentials. 

Jensen  (1986)  went  on  to  describe  a  hierarchical  factor 
structure  wherein  variance  unique  to  individual  tests  is  filtered 
out  at  the  level  of  primary  factors,  variance  unique  to  primary 
factors  is  filtered  out  at  the  level  of  secondary  factors, 
etcetera,  with  the  g  factor  appearing  at  the  highest  level.  He 
states : 

The  g  factor  is  remarkably  stable  across  different 
collections  of  mental  tests,  even  collections  of  tests  that 
bear  hardly  any  superficial  resemblance  to  one  another  .  . 

,  .  all  so-called  intelligence  tests,  or  "IQ"  tests,  even 
when  they  have  not  been  constructed  with  reference  to  factor 
analysis,  are  found  to  be  very  highly  g  loaded,  (p  308  and 
310) 

Though  the  ASVAB  was  not  constructed  via  factor  analytic 
methods,  researchers  have  applied  such  procedures  to  the  test  to 
determine  its  factor  structure.  These  analyse, s  have  indicated 
tliat  the  r(?gres.sion-base(l  composites  do  not  corres|>ond  to 
factor-analytic  composites  (Hunter,  1983b;  U..S.  Naval  Personnel 
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Research  Activity,  1981). 

In  Hunter's  investigation  (1983b)  he  produced  a  hierarchical 
factor  model  of  military  subtests  through  analysis  of  available 
factor-analyzed  data  (Maier  &  Grafton,  1981;  Sims  &  Hiatt,  1981; 
Thorndike,  1957).  He  identified  three  factors  common  to  all  the 
data:  Quantitative,  Verbal,  and  Technical  composites.  The 
factor  model  identified  by  Hunter  (see  figure  1)  exemplifies  the 
hierarchical  structure  described  by  Jensen  (1986).  Because 
Hunter's  ('xtersive  .study  clearly  describes  the  relationships 
among  ASVAB  subtests,  it  will  be  discussed  in  some  detail. 

Hunter  (1983b)  investigated  additional  data  (Kass,  Mitchell, 
Grafton,  &  Wing,  1982),  based  on  ASVAB  forms  8,  9,  and  10,  using 
confirmatory  factor  analysis  (N  =  98,689).  Only  6  of  the  10 
siibtosts  were  used.  One  subtest,  AS,  did  not  load  significantly 
on  any  factor.  The  inclusion  of  AS  in  the  Technical  factor  with 
K1  and  MC  would  have  generated  inconsistent  data.  For  this 
reason,  AS  v;as  omitted  from  further  analysis.  Hunter  found  NO 
and  CS  (the  two  speeded  subtests)  to  correlate  highly  with  each 
other,  yet  NO  consistently  correlated  higher  with  other  subtests 
lluin  did  CS.  He  found  that,  in  the  presence  of  good  verbal  and 
(|unntltative  measures,  NO  did  not  raise  validity  meaningfully. 
Also,  NO  and  C.S  did  not  appear  to  measure  the  same  construct  as 
l’erce()tu/  1  Speed  subtests  of  other  batteries,  thus  these  subtests 
were  excluded  for  tlic  reasons  S|)ecified.  The  last  exclusion  was 


Figure  1 

Hunter's  Hierarchical  Factor  Model 
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PC.  Though  very  similar  in  composition  to  WK,  PC  had  a 
reliability  of  about  .75,  while  the  reliability  of  WK  was  found 
to  be  .90.  Weighting  schemes  for  combining  the  two  by  giving 
more  weight  to  WK  did  not  improve  reliability,  so  PC  was  dropped. 
Hunter  stressed  that  these  four  subtests  were  not  excluded  for 
reasons  of  invalidity,  but  for  the  purpose  of  allowing 
construction  of  composites  that  are  comparable  with  civilian 
test  battery  data. 

Hunter  found  that  the  remaining  six  subtests  produced  three 
highly  intercorrelated  factors,  designated  as  Quantitative 
.'otitvde  (AR  +  UK),  Verba.;  A{.  .i«,ude  (WK  +  GS),  and  Technical 
Aptitude  (El  +  MC).  His  analy;  •.  s  showed  that  validity  is  not 
higher  when  analyses  are  performed  with  all  ten  subtests.  Table 
2  presents  the  ASVAB  composite  structure,  and  the  factor 
structure  isolated  by  Hunter.  He  concluded  that  "factor-analytic 
composites  are  more  valid  than  the  multiple-regression  composites 
for  predicting  job  performance"  (Hunter,  198A,  p  72). 

Through  a  subsequent  confirmatory  factor  analysis,  Hunter 
identified  a  second-order  factor  which  he  labeled  General 
Cognitive  Ability.  The  best  estimate  of  this  general  factor  was 
defined  as  the  sum  of  the  three  aptitude  composites  (Verbal  + 
Quantitative  +  Technical),  or  as  the  sum  of  the  six  underlying 
subtests.  Table  2  shows  the  composition  of  General  Cognitive 
Ability. 
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Table  2 

ASVAB  Composite  and  Factor  Structures 


General  Cognitive  Ability 


Speed^  Quant^  ^  Verbal^  ^  Tech^  ^ 


AS^  PC** 

NO^  CS** 

AR  MK 

WK  GS 

El  MC 

M 

X 

X 

1 

X 

A 

X 

X  X 

X 

G 

X 

X 

X 

E 

X  X 

X 

X 

AFQT 

X 

X 

X 

X 

Indicates  subtests  excluded  from  Hunter's  model. 

^  Indicates  the  four  factors  Initially  derived  through 
exploratory  factor  analysis. 

Indicates  the  three  factors  Isolated  by  confirmatory  factor 
analysis. 
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In  yet  another  confirmatory  factor  analysis  from  the  same 
study,  Hunter  analyzed  the  data  available  from  Sims  and  Hiatt 
(1981)  and  Maier  and  Grafton  (1981),  both  derived  from  ASVAB 
forms  8,  9,  and  10,  and  he  also  included  data  from  Thorndike 
(1957),  based  on  the  now-obsolete  Army  AC-IB.  The  Maier  and 
Grafton,  and  Sims  and  Hiatt  data  produced  a  similar  pattern  (see 
figure  2)  but  the  Thorndike  data  produced  two  differences 
relative  to  the  model.  An  additional  factor.  Perceptual 
Aptitude,  was  obtained  consisting  of  now-obsolete  subtests, 

Also,  two  of  the  AC-IB  primary  factor.s  contain  subtest 
configuration.s  different  from  the  other  data  (due  to  test 
revisions  over  the  years),  thus  these  subtest  validities  are  not 
shown  in  figure  2. 

That  all  the  data  sources  produced  a  similar  pattern  of 
aptitudes  provides  strong  support  for  existence  of  the  second- 
order  factor,  General  Cognitive  Ability.  For  example,  results 
from  the  Kass,  et  al.  (1982)  data  indicate  that  the  correlations 
between  the  aptitude  factors  differ  by  no  more  than  sampling 
error  (see  Table  3).  Hunter's  measurement  model  provides 
evidence  that  the  individual  ASVAB  subtests  contribute  to  overall 
validity  via  their  contributions  to  one  of  the  three  jirimary 
aptitudes.  The  validities  of  the  primary  aptitudes,  in  turn, 
are  not  useful  due  to  independent  contrj but  ions,  but  to  their 
inclusion  in  the  second-order  factor,  Getreral  Cognitive  Ability. 


Figure  2 

Path  Model  Relative  to  Military  Composite  Data 
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Sims  &  Hiatt  (1981)  data. 
Maier  &  Grafton  (1981)  data. 
Tiiorndikc  (1957)  data. 
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Table  3 


Intercorrelations  of  Factors 


Note.  Data  as  analyzed  by  Hunter,  et  al.  (1985),  obtained  from 
Kass,  et  al.  (1982).  (N  =  98,689) 

^  Q  (Quantitative  Aptitude)  =  AR  +  MK 
^  V  (Verbal  Aptitude)  =  WK  +  GS 

^  T  (Technical  Aptitude)  =  El  +  MC 

^  G  (General  Cognitive  Ability) 
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The  general  factor  provides  a  measure  of  job  performance  due  to 
its  relationship  to  job  knowledge  acquisition. 

General  Cognitive  Ability  was  seen  to  predict  job  training 
performance  in  specific  areas  as  well  as  or  better  than  the 
regression-based  composite  of  specific  abilities  designed 
expressly  for  that  occupational  area.  Only  one  exception  was 
observed.  Perceptual  Speed  made  an  incremental  contribution  to 
validity  over  General  Cognitive  Ability  in  predicting  performance 
for  clerical  occupations  (Hunter,  1984),  Hunter  presumed  this 
was  due  to  effects  of  the  speeded  ASVAB  subtest,  CS.  His  meta¬ 
analysis  of  eight  military  studies  (1985)  verified  the  evidence 
for  differential  validity  in  business  and  clerical  occupations 
due  to  the  Perceptual  Speed  factor.  The  validity  of  Speed  for 
clerical  work  was  .43,  ver.sus  .37  for  other  types  of  work. 

Hunter  identified  a  general  Perceptual  Aptitude  factor  as 
improving  validities  beyond  the  influence  of  General  Cognitive 
Ability  relative  to  Thorndike's  (1957)  data.  Current  versions  of 
the  ASVAB  however,  do  not  contain  Perceptual  Aptitude  measures. 

Hunter's  extensive  work  on  military  data  has  shown  the 
ASVAB  composites  to  be  of  acceptable  validity,  though  in  light  of 
General  Cognitive  Ability  as  a  moderating  factor,  perhaps  the 
compo.sitcs  are  of  questionable  utility. 

Tliough  Hunter  and  Sclimidt's  validity  generalization 
procedure  has  been  .shown  to  be  a  viable  procedure,  it  has  not 
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been  without  critics.  Forty  questions  regarding  meta-analysis 
and  its  applications  to  validity  generalization  are  presented  in 
a  debate  format  (Sackett,  Schmitt,  Tenopyr,  Kehoe,  &  Zedeck, 

1985;  Schmidt,  Hunter,  Pearlman,  &  Hirsch,  1985).  For  23  of  the 
questions,  Schmidt,  et  al.'s  responses  stand  as  acceptable, 
Schmidt  (1988)  comments; 

The  commentators  ....  took  no  issue  with  the  major 
practical  conclusion  of  meta-analytic  research  in  personnel 
selection:  that  validities,  particularly  of  cognitive 

tests,  have  been  shown  to  be  widely  generalizable  across 
settings,  jobs,  populations,  organizations,  geographical 
areas,  time  periods,  etc.  Most  of  the  commentary  was  in  the 
nature  of  a»  '^empts  to  "fine  tune"  statistical  methods  or 
applications,  (p  179) 

Practical  acceptance  of  meta-analysis  is  evident  from  the 
V7ide  application  it  has  been  given,  including  topics  outside  the 
realm  of  personnel  selection.  Examples  include:  correlates  of 
role  conflict  and  role  ambiguity  (Fisher  f-  Gittelson,  1983), 
evaluation  of  Fiedler's  theory  of  leadership  (Premack  &  Wanous, 
1985),  and  ability  of  financial  analysts  to  predict  stock  growth 
(Coggin  &  Hunter,  1983). 

The  USES  and  the  federal  government  are  currently  the  two 
largest  users  of  validity  generalization.  The  USES  has  adopted 
validity  generalization  as  the  basis  for  its  testing  program 


that  operates  through  state  employment  services.  Civilian  users 
Include  the  American  Petroleum  Institute,  Sears- Roebuck,  and 
various  insurance  and  utilities  industries  (Schmidt,  1988). 

Validity  generalization  concepts  have  been  incorporated  in 
psychological  texts  (Anastasl,  1982),  and  In  the  Standards  for 
Educational  and  Psychological  Testing  (APA,  1974i).  The  Uniform 
Guidelines  on  Employee  Selection  Procedures  issued  in  1978 
(Uniform  Guidelines,  1978)  are  more  accepting  of  validity 
generalization  than  the  previous  (1970)  guidelines,  and  "spell 
out  more  precisely  tl)e  methods  necessary  to  "borrow  validity" 
or  generalize  validity  results  to  similar  Jobs"  (Baker  & 

Terpstra,  1982,  p  603).  Baker  and  Terpstrn  specify  two  court 
cases  in  which  generalized  validities  have  Ixjen  accepted  as 
evidence  to  refute  racial  and  sex  discrimination  charges:  Friend 
V.  Leidinger,  1977,  and  Pegucs  v.  NlsBissippi  Sla^  Epifilojiine 
Service,  1980.  It  is  thus  not  unreasonable  to  expect  that 
future  revisions  of  the  guidelines  will  Include  an  even  broader 
acceptance  of  validity  generalization  procedures. 

Statement  of  Purpose 

This  study  Investigated  charncli’r Isl Ics  ol  ASVAU  validity 
data  via  Schmidt  and  Hunter's  metn-analyi ic  prot cdui es.  In  light 
of  the  research  reviewed,  It  follows  that  (Jencral  Oo/’.iil I  Ive 
Ability  moderates  the  relationship  between  Job  knowledge  and 


A3 


job  performance.  Also,  verbal  and  numerical  skills  constitute 
the  major  components  of  General  Cognitive  Ability.  The  AFQT, 
containing  two  verbal  subtests  (WK  and  PC),  plus  two  numerical 
subtests  (AR  and  NO) ,  may  be  considered  to  represent  a  general 
cognitive  measure. 

Hunter's  work  in  particular  (1983a,  1983b,  198A,  1985)  has 
demonstrated  higher  predictive  validities  for  General  Cognitive 
Ability  than  for  the  regression-based  ASVAB  selector  composites. 
This  study  extends  Hunter's  work  by  attempting  to  demonstrate 
that  the  AFQT,  as  a  general  cognitive  measure  intrinsic  to  the 
ASVAB,  is  a  valid  predictor  of  Air  Force  technical  training 
course  success.  In  line  with  Hunter's  results,  the  general 
cognitive  measure  (AFQT)  was  expected  to  improve  prediction 
beyond  that  given  by  the  four  selector  AIs  (Mechanical, 
Administrative,  General,  and  Electronic).  Furthermore,  it  was 
expected  that  the  improvement  in  prediction  would  be  greater  for 
women  than  for  men.  Females  have  been  shown  to  score  somewhat 
higher  than  men  on  cognitive  ability  measures,  thus  the  females' 
validities  were  expected  to  raise  when  the  AFQT  alone  was  used 
as  L[ie  [^reurcLui.  xiiO  AFQT  iS  not  based  on  specralj-zed 
experience  to  the  extent  of  the  other  composites.  For  example, 
general  cognitive  measures  (as  with  the  AFQT)  lack  mechanically- 
related  items,  on  which  men  tend  to  score  higher  than  women. 


The  formal  hypotheses  to  be  tested  are; 

1-  Ho:  /afqT  "  ^ 

^1'  ^APQT  ^  ^  occupations. 


The  validity  of  the  AFQT  for  predicting  Air  Force  technical 
training  success  is  essentially  zero,  for  all  job/training-course 
combinations. 


2.  Ho:  /afqi;  "  /m 

^1  •  'l^AFQT  ^  M 


for  Mechanical  occupations. 


AFQT  validity  will  be  essentially  equal  to  (will  not  exceed) 
validity  of  the  M  composite  for  predicting  training  success  in 
mechanical  occupations. 

3.  Ho:  / AFQT  "  /a 

for  Administrative  occupations. 


/afqt  ^ 


AFQT  validity  will  be  essentially  equal  to  (will  not  exceed) 
validity  of  the  A  composite  for  predicting  training  success  in 
administrative  occupations. 
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/afqt  '  /g 

^4'  7^ AFQT  ^  T^G  General  occupations. 

afqt  validity  will  be  essentially  equal  to  (will  not  exceed) 
validity  of  the  G  composite  for  predicting  training  success  in 
general  occupations. 

5-  ^  "  /e 

for  Electronic  occupations. 

AFQT  validity  will  be  essentially  equal  to  (will  not  exceed) 
validity  of  the  E  composite  for  predicting  training  success  in 
electronic  occupations. 

6.  Ho;  /aFQT.F  '  /aI.F  "  /aFQT.M  ‘  fkl.W 

/afqt.f  “  /ai.f  ^  /afqt.m  "  /ai.m 

Any  incremental  validity  observed  due  to  the  AFQT  will  be 
equal  for  females  and  males,  for  all  occupations. 
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CHAPTER  II 
METHODS 


The  Instrument 

Current  ASVAB  versions  (forms  8  through  14)  consist  of  33A 
items  in  10  subtests.  The  ASVAB  is  group  administered  by  trained 
DoD  personnel  in  Military  Entrance  Processing  Stations  and  their 
satellites  (about  1,000  locations),  and  is  also  administered  at 
approximately  14,000  high  schools  nation  wide.  The  volume  of 
tests  given  is  approximately  2.3  million  per  year. 

ASVAB  subject  areas  were  chosen  according  to  their  validity 
for  predicting  training  success  in  each  of  the  military  services. 
The  content  of  subtests  of  forms  8  through  14  is  shown  in  Table 
4.  Total  testing  time  is  144  minutes. 

Subjects 

Data  were  obtained  from  an  AFHRI.  validation  atudy  ferlorroed 
by  Wilbourn,  Valentine,  and  Ree  (1984).  The  .subjects  for  their 
research  consisted  of  29,619  males  and  females,  aged  17  to  24.  A 
description  of  these  subjects  is  given  in  Table  5.  The  subjects 
were  all  first-term  Air  Force  enlistees,  most  of  whom  were  high 
school  graduates.  They  were  tested  on  ASVAB  form.s  8,  9,  and  10 
between  October,  1980,  and  March,  1982. 

The  subject  group  was  restricted  in  range  at  the  low  score 
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Table  A 

ASVAB  Subtest  Content 


Subteat 

Ccneral  Science 
(GS) 

Arltiwnctic  RcaaoniriR 
(AK) 


Content 

general  science,  including 
biology  and  physics 

arithmetic  word  problems 


Word  Knowledge  °  selecting  synonyms 

(WK) 


Pnrngrnph  Comprehension 
(I*C1 


ability  to  understand  a 
written  text 


Nllinerlciil  Opcrntlons 
(NO) 


simple,  arithmetic 
calculations 


Coding  .S|)eijd 


I) 


(CS) 


H»ib8tl luting  numeric  codes 
lor  verbal  material 


Number  of 
Questions 

25 


30 


35 


15 


50 


8A 


(table  continues) 


Subtest 

Content 

Number  of 

Questions 

Auto  and  Shop  Information 

automobile  and  tool-usape 

25 

(AS) 

knowledge 

Mathematics  Knowledge 

calculations  including 

25 

(MK) 

algebra,  geometry,  and 

elementary  trigonometry 

Electronics  Information 

electrical  principles 

20 

(El) 

and  terminology 

a 


b 


AFQT  oubtests. 
Speeded  subtests. 
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Tablt  $ 

Deicription  oC  Subjccti  by  Gender  and  Ethnicity 


N 


Willie  mnlcH 

21,544 

While  fctmtlcs 

2,702 

Black  males 

4,040 

Black  females 

590 

Other  races 

733 

Totals 

29,619 

Total  white 

24,256 

Total  black 

4,630 

Other  races 

733 

Totals 

29,619 

Total  males 

26,259 

Total  females 

3,360 

Totals 

29,619 

rercent 

72.8 
9.1 

13.6 

2.0 

100.0 

81.9 

15.6 

-JA 

100.0 

88.7 
11.3 

100.0 
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levels  to  the  extent  that  only  those  passing  certain  cutoff 
scores  gained  enlistment.  (Selection  criteria  included  the  AFQT 
score,  the  sum  of  the  four  AIs,  and  the  General  AI  score).  An 
additional  restriction  factor  curtailed  the  technical  school 
samples  at  the  upper  score  levels.  Because  each  technical  school 
maintains  a  cutoff  score  on  an  appropriate  selector  AI,  higher¬ 
scoring  subjects  were  assigned  to  schools  with  higher  eligibility 
requirements. 

The  Criterion 

The  validation  criterion  used  in  the  study  was  technical 
training  course  final  grades.  The  courses  included  were  the  70 
courses  which  had  at  least  100  graduates.  Data  were  computed  on 
the  subgroups  which  had  at  least  25  individuals.  Technical 
school  grades  generally  range  between  70  (passing)  and  100. 
Approximately  four  percent  of  enlistees  do  not  pass  the  training 
courses.  Attrition  thus  further  restricted  the  sample  range.  As 
previously  discussed,  training  school  success  is  a  highly 
appropriate  criterion,  as  it  directly  predicts  job  performance 
(Hunter,  198/4 ). 

Procedures 

Meta-analytic  procedures  were  used  to  combine  validities 
given  in  the  Wilbourn,  et  al.  data,  across  the  four  AIs  and  the 
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AFQT,  for  the  entire  group,  and  separately  for  males  and  females. 

Meta-analytic  procedures  of  Schmidt  and  Hunter  utilize 
corrected  correlations,  however,  no  procedure  is  presently 
available  for  estimating  the  standard  errors  of  corrected 
correlations.  For  this  reason,  the  original  data  wore  combined 
twice;  first,  as  uncorrected  raw  data;  secondly,  as  corrected  for 
range  restriction  and  unreliability.  Confidence  intervals  were 
then  built  around  the  uncorrected  validities,  and  corrected  data 
were  applied  to  adjust  the  endpoints  of  the  Intervals  (Hunter, 
Schmidt,  &  Jackson,  1982).  The  corrected  confidence  intervals 
were  compared  to  determine  overall  effects,  lletailed  procedural 
steps  were  as  follows: 

For  the  corrected  group: 

1.  correct  validity  coefficients  for  range  reutrictlon; 

2.  correct  values  obtained  in  step  1  for  unreliability; 

For  both  corrected  and  uncorrected  groupo: 

3.  transform  validities  to  Fisher's  z  values; 

A.  weight  and  combine  validities  (z’s)  for  total,  male,  nnd 
female  groups,  within  each  Job  category  (M,  A,  G,  at»d  K) 

5.  estimate  mean  and  standard  deviation  for  each  group  from 
step  A ; 

6.  convert  z  values  to  r's; 

For  computing  confidence  interval n: 

7.  compute  confidence  Intervolo  around  uiicuK  e'Jt'd 
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validities,  correcting  the  endpoints  for  range 
restriction  and  unreliability; 

8.  compare  confidence  Intervals  to  determine  whether  they 
overlap. 

Reliability  measures  were  obtained  from  a  recent  AFHRL 
study  (Palmer,  Hartke,  Ree,  Welsh,  &  Valentine,  1988).  The 
correction  calculations  were  obtained  from  Hunter,  et  al.  (1982), 
and  are  given  in  Appendix  A.  Fisher's  z  transformation  was  used 
to  convert  the  data  into  a  linear  scale,  z  values  were  obtained 
from  a  transformation  table  (Agrestl  &  Finlay,  1986). 

The  z  transformation  was  used  in  this  study  despite  the  fact 
that'  Hunter  and  Schmidt  prefer  not  to  utilize  z's.  Hunter,  et 
al.  (1982)  state  that  the  z  transformation  produces  positive  bias 
in  obtained  correlations.  As  evidence,  they  cite  a  validity 
generalization  study  for  computer  programmers  (Schmidt,  Gast- 
Rosenberg,  &  Plunter,  1980),  in  which  average  validities  were 
larger  by  about  .07  due  to  the  z  transformation.  They  add, 
however , 

The  fact  that  the  Fisher  z  transformation  leads  to  bias  in 
averaging  correlations  does  not  mean  that  there  is  any 
problem  with  the  transformations  in  the  context  for  which  it 
was  invented,  i.e.,  confidence  intervals.  The  Fisher  z 
transformation  expands  large  correlations  relative  to  small 
ones,  which  causes  the  confidence  interval  around  large 
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correlations  to  be  smaller  than  those  around  small 
correlations.  However,  the  confidence  intervals  around 
large  correlations  should  be  smaller  because  sampling  error 
is  smaller.  Thus  the  expansion  has  the  desired  effect  on 
confidence  intervals,  (p  42) 

Final  results  were  returned  to  r  values  for  interpretation. 

Weighting  procedures  were  used  in  step  4  for  combining  the 
validities,  and  in  step  7  for  computation  of  confidence  intervals 
because  the  stability  of  a  correlation  is  solely  dependent  upon 
the  sample  size.  Weighting  validities  by  the  Ns  of  the  samples 
gave  more  weight  to  correlations  based  on  larger  samples. 
Weighting  the  correlations  was  particularly  important  in  order  to 
allow  gender  comparisons.  The  Ns  of  female  samples  are  typically 
smaller  than  those  of  male  samples  (see  Appendix  B  for  sample 
data) . 

Correcting  for  range  restriction  required  three  values: 

1.  standard  deviations  of  the  predictor  variable  (ASVAB  scores), 
which  represents  the  restricted  sample;  2.  the  correlations  of 
the  predictor  with  the  criterion  (final  school  grades);  and 
3.  the  standard  deviation  of  the  unrestricted  sample.  This 
value  (28.29)  represents  the  standard  deviation  of  any  complete 
percentile  scale  (Netter,  Wasserman,  &  Whitmore,  1988). 

Reliabilities  corrected  included  only  those  of  the 
predictor.  As  is  often  the  case,  the  criterion  reliabilities 
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are  difficult  to  estimate  (Hunter,  et  al.,  1982).  The  corrected 
values  are,  as  a  result,  more  conservative  than  if  both  predictor 
and  criterion  reliabilities  were  corrected.  Additional 
uncorrectable  factors,  such  as  variations  in  technical  school 
lengths,  grading  systems,  and  numbers  of  students,  lend 
conservatism  to  the  corrected  values. 

Analyses  of  the  corrected  data  were  performed  via  computer 
analysis,  utilizing  SPSS’^  procedures  (SPSS  Inc.,  1986). 

For  the  final  step,  evaluating  the  data  for  overall  effects, 
95  percent  confidence  Intervals  were  formed  around  obtained 
values.  Based  on  the  methods  of  Hunter,  et  al.  (1982), 
confidence  intervals  avoid  problems  of  accumulating  Type  I  and 
Type  II  errors.  The  endpoints  of  the  Intervals  were  corrected 
according  to  the  formulae  described  in  Appendix  A.  Overlapping 
confidence  intervals  were  taken  as  indications  that  the  values 
did  not  differ.  The  reader  should  be  aware  that  the  procedure  of 
correcting  the  endpoints  of  confidence  intervals  can  create 
nonsyjr/netrlcal  intervals. 
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chapter  III 
RESULTS 


Tabled  data  are  presented  to  four  decimal  places,  for  two 
reasons.  First,  all  calculations  were  made  at  this  level  of 
precision.  Secondly,  rounding  to  two  decimal  places,  as  most 
validity  data  are  present  <  lit  the  literature,  obscured  small 
differences  which  were  consi. jred  important. 

Table  6  presents  the  validities  (mean  r)  as  combined  and 
corrected  through  meta-analysis.  Also  shown  are  the  95-percent 
confidence  intervals  computed  around  each  validity  coefficient. 
Comparisons  were  made  of  validities  resulting  from  prediction 
by  composites,  versus  validities  resulting  from  prediction  by  the 
AFQT  for  total,  male,  and  female  groups  within  each  composite 
category  (M,  A,  G,  and  E).  Comparisons  which  identified 
significantly  (p  ^  .05)  different  validities  (95  percent 
confidence  intervals  did  not  overlap)  are  identified,  with  the 
direction  of  the  difference  indicated  by  "<"  or  ">". 

In  Table  7,  the  proportion  of  variance  accounted  for  by  each 

mean  corrected  validity  coefficient  is  presented.  The  values  in 

the  difference  column  (r*  -  r*)  indicate  the  amount  of  change  in 

c  a 

prediction  as  a  result  of  utilizing  each  alternative  (composite 
or  AFQT).  Alternatives  with  significantly  higher  validities  are 
Indicated  by  the  "<"  or  ">"  comparison. 
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Table  6 

Corrected  Vallditleg  and  Confidence  Intervals  for  Comparigon 
Condltiona 


ComfWBlte 


AFxrr 


Mean  Mean 

_i_.  93X  Cl  r 


Total 

,8330 

.8189 

to 

.8477 

m 

.8115 

.7981 

to 

,8207 

M 

Males 

.8475 

.8314 

to 

.8616 

m 

.8349 

.8214 

to 

.8503 

Females 

.8305 

.6509 

to 

.9105 

m 

.8103 

.7134 

to 

.0702 

Total 

.4325 

.3701 

1.0 

.4859 

< 

,0561 

.8331 

to 

.0766 

A 

Males 

.5035 

.4359  to 

.5632 

> 

.8689 

.8392 

to 

.8882 

Females 

.2552 

.1046 

to 

.3651 

< 

.8655 

.8067 

lo 

.9018 

Total 

.9142 

.9030 

to 

.9202 

> 

.8515 

.8416 

to 

.8660 

G 

Males 

.9220 

.9035 

to 

.9227 

> 

.8561 

.8409 

to 

,867'^ 

Females 

.9510 

.9124 

to 

.9643 

> 

.8896 

,8282 

to 

,9044 

Total 

.9239 

.9(X)2 

lo 

.9218 

> 

.8384 

.8279 

to 

.8071 

E 

Males 

.9268 

.9241 

to 

.9410 

> 

.8589 

.8422 

lo 

.8805 

Females 

.9674 

.7872 

to 

.9887 

■ 

.7855 

.6451 

lo 

.9113 
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Table  7 


HatliMtea  of  Proportion  of  Variance  Accounted  for  by  the 


veraua  the 


r*  r*  r»  -  r» 


Total 

Cowpoaitc 

.6939 

.8 

AFQT 

.6585 

r'  -  r' 
c  a 

.0359'’ 

M 

Halos 

.7183 

a 

•r 

.6970 

.0212“ 

Pemnlea 

.6897 

a 

■ 

.6566 

.0331® 

Total 

.1870 

< 

.7329 

-.5909 

A 

Halos 

.2535 

< 

.  7550 

-.5015 

Fomalua 

.0651 

< 

.7991 

-.6990 

Total 

.8358 

> 

.7250 

.1108 

0 

Hales 

.6501 

> 

.7329 

.1172 

Females 

.9099 

> 

.7919 

.1130 

Total 

.8536 

> 

.  7029 

.1327 

V. 

Hales 

.8589 

> 

.7377 

.1212 

Fem/iles 

.9359 

a 

• 

.6170 

.3189® 

**  Confldonco  inlervala  indicate  no  nl^'tiif leant  difference 
|)fllwoeti  the  cofn|ioBJlc  vermis  the  Af'QT  for  these  groups, 
lliufi  l|pe  r*  dlllerenres  mny  Im?  Intorprelod  as  zero. 


h 


IM  1 1  eieni’e  liel.wei'ii  r*  composite  iind  r*  AKQT, 


58 


Null  hypothesis  number  one  was  rejected.  The  alternate 
hypothesis  was  supported,  which  proposed  that  the  AFQT  would 
demonstrate  validity  for  predicting  training  success  for  all 
occupations.  AFQT  validity  was  supported  for  all  job  groups, 
and  Independently  for  male  and  female  subgroups.  AFQT  validities 
range  from  approximately  .79  to  .89,  thus  the  AFQT  accounts  for 
61  to  79  percent  of  the  criterion  variance. 

Null  hypothesis  number  two  was  not  rejected.  The  alternate 
hypothesis  proposed  that  the  AFQT  validity  would  be  significantly 
higher  than  the  validity  of  the  Mechanical  (M)  composite  for 
predicting  training  success  for  M  jobs,  but  this  was  not 
supported.  Analyses  of  the  confidence  intervals  indicate  equal 
validities  for  the  M  composite  and  the  AFQT. 

Null  hypothesis  three  was  rejected.  Alternate  hypothesis 
three,  that  the  AFQT  would  have  higher  validity  than  the 
Administrative  (A)  composite,  is  clearly  supported  by  the  data. 
The  AFQT  accounts  for  about  73  to  75  percent  of  the  variance 
(about  50  to  69  percent  more  than  the  A  composite).  Inspection 
of  Table  6  shows  substantial  Increases  in  validity  due  to  the 
AFQT,  but  particularly  for  females. 

According  to  null  hypotheses  four  and  five,  the  General  (G) 
and  Electronic  (E)  composites  should  have  validities  equal  to  the 
AFQT  validity  for  predicting  training  success  for  G  and  E  jobs 
respectively.  The  null  hypotheses  were  not  rejected  in  these 
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cases.  Higher  validities  are  indicated  for  the  G  and  E 
composites  than  for  the  AFQT  in  all  cases  but  one.  That  is,  the 
data  indicate  composite  and  AFQT  validities  are  equal  for 
predicting  training  success  only  for  E  jobs,  and  only  for  the 
female  group.  For  all  other  G  and  E  conditions,  the  composites 
algniflcantly  ou'-nerform  the  AFQT,  accounting  for  about  11  to  13 
percent  more  variance  than  the  AFQT  (see  Table  /). 

According  to  alternate  hypothesis  six,  any  incremental 
validity  seen  due  to  the  AFQT  rather  than  the  specific  composites 
would  be  greater  for  females  than  males.  Only  in  the  A  job 
categories  were  validities  higher  for  the  AFQT,  and  in  these 
cases,  females  did  show  a  significantly  larger  increase  than 
males.  For  females,  the  increase  (the  difference  between  AFQT 
and  A  r-square  values)  was  .69A0.  For  males,  the  increase  was 
.5015.  A  comparison  of  the  confidence  intervals  (see  Table  6) 
shows  that  the  composite  confidence  interval  for  females  lies 
below  that  for  males,  with  no  overlap.  The  conclusion  drawn  is 
that  the  increase  in  validity  is  clearly  greater  for  females  than 


for  males. 
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CHAPTER  IV 
DISCUSSION 

For  predicting  success  in  the  mechanical  jobs,  M  and  AFQT 
validities  are  not  significantly  different,  statistically 
speaking.  However,  had  the  mean  validities  for  M  been 
significantly  higher,  the  incremental  validity  provided  by  M 
would  range  from  .0212  to  .035A  (r*  values).  These  values 
would  provide  seemingly  small  advantages.  With  large  samples, 
however,  even  small  differences  can  be  important.  Small  amounts 
of  predictive  validity  can  be  advantageous  when  applied  toward 
the  classification  of  a  large  number  of  individuals.  Practically 
speaking,  then,  these  differences  would  support  the  continued  use 
of  the  M  composite  for  predicting  success  in  M-type  jobs.  The 
AFQT  (or  any  other  composite)  should  not  be  substituted  for  M 
unless  clearly  shown  to  be  a  superior  predictor  over  the  M 
composite.  Substantial  differential  validity  favors  the  G  and  E 
composites  for  predicting  success  in  general  and  electronic 
occupations,  and  favors  the  AFQT  for  predicting  success  in 
administrative  occupations. 

It  is  appropriate  at  this  point  to  compare  these  results  to 
those  found  by  Hunter.  In  Hunter's  (1983b)  analysis,  he  included 
a  technical  composite,  consisting  of  El  plus  MC,  as  part  of  his 
measure  of  General  Cognitive  Ability  (see  figure  1).  He  stated 
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that  "a  measure  of  Technical  Aptitude  ....  augments  the 
measure  of  General  Cognitive  Ability"  (Hunter,  et  al.,  1985, 
p  viii).  In  the  present  study,  the  AFQT  has  been  cast  in  the 
role  of  a  measure  of  General  Cognitive  Ability.  However,  the 
AFQT  contains  no  Technical  Aptitude  subtests.  In  addition, 
Hunter's  General  Cognitive  Ability  composite  excluded  speeded 
subtests,  but  the  AFQT  contains  a  half-weighted  value  of  NO. 
Speeded  tests  such  as  NO  commonly  have  low  validity  (Hunter, 
et  ai.,  1985;  Thorndike,  1986).  These  differences  between  the 
measures  of  General  Cognitive  Ability  used  in  the  two  studies 
could  help  to  explain  why  Hunter's  analysis  showed  less 
differential  validity  than  is  observed  in  the  present  study. 

Another  difference  between  the  two  studies  is  the  number  of 
subtests  contained  in  the  General  Cognitive  Ability  measure. 
Hunter  included  six  reliable  subtests,  but  this  study  used  only  . 
three  subtests  with  equally  acceptable  reliability  (reliability 
of  NO  is  low  in  comparison  with  other  subtests).  Because 
reliability  of  measurement  increases  as  the  number  of  reliable 
components  increases,  error  of  measurement  may  be  less  for 
Hunter's  study. 

The  AFQT  was  chosen  to  represent  General  Cognitive  Ability 
for  this  meta-analysis  for  two  reasons.  First,  its  verbal  and 
mathematic  subtests  (respectively,  WK  and  PC;  AR  and  NO)  are 
representative  of  the  major  areas  on  which  most  batteries 
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measuring  General  Cognitive  Ability  concentrate.  (Many 
educational  batteries,  for  example,  rely  on  only  verbal  and 
quantitative  aptitude  measures  [Hunter,  et  al.,  1985)). 

Secondly,  because  AFQT  scores  are  currently  computed  for  ASVAB 
examinees,  transition  from  using  a  less  valid  composite  to  the 
AFQT  could  be  accomplished  with  relative  ease. 

Recent  re.search  (Jones,  1988)  indicates  that  the  AFQT  may 
not,  in  fact,  be  the  best  measure  of  g,  or  General  Cognitive 
Ability  obtainable  via  the  ASVAB  subtests.  Jones  determined  the 
g  loadings  of  the  10  ASVAB  subtests  through  a  principle 
components  analysis,  then  ranked  the  subtests'  g  loadings,  and 
determined  the  Spearman  rank-difference  correlation  coefficient. 
Her  data  base  for  the  principle  components  analysis  was  the  1980 
Profile  of  American  Youth  (N  =  approximately  12,000).  Data  for 
the  correlational  computation  were  chosen  from  an  Air  Force 
accessions  validity  study  (Wilbourn,  Valentine,  &  Ree,  198A). 
(Total  N  for  Jones'  study  =  2A,A82-,  N  of  her  samples  ranged  from 
107  to  A, 673).  She  found  that  the  subtests'  g  loadings  are 
positively  correlated  with  their  average  corrected  validity 
coefficients  (Spearman's  rho  =  .75).  Jones'  validity  data,  and 
the  data  used  in  tliis  meta-analysis  are  from  the  same  source.  As 
would  be  expected,  the  results  are  rather  consistent,  regardless 
of  the  analysis  performed. 

Table  8  presents  the  .sublests  and  their  g  loading  ranks  as 
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determined  by  Jones,  as  well  as  the  subtest  configurations  of  the 
composites.  A  number  of  points  may  be  made  relative  to  these 
results. 

First,  it  becomes  clear  that  a  better  measure  of  g  than  the 
AFQT  configuration  is  available.  The  AFQT  subtests  WK  and  AR 
appear  highly  g  loaded,  as  would  be  expected.  However,  PC  and  NO 
could  be  replaced  with  GS  and  MK,  to  utilize  the  four  most  highly 
ranked  g-loaded  subtests.  Secondly,  both  speeded  subtests  of 
the  ASVAB  (NO  and  CS)  rank  comparatively  low  in  terms  of  g 
(eighth  and  tenth).  That  these  subtests  have  quite  low  g  ranks, 
as  well  as  low  validities  merits  their  exclusion  from  a  composite 
Intended  to  measure  General  Cognitive  Ability. 

Jones'  results  are  helpful  in  explaining  the  results  of  the 
present  study.  Most  striking  is  the  comparison  of  the  g-loadlng 
ranks  of  the  E  composite  versus  the  AFQT.  The  median  g-loadlng 
rank  of  the  composite  is  approximately  3,  compared  to  a  median 
rank  of  the  AFQT  of  about  5.  All  E-composite  subtests,  Including 
the  specifically  electronic  subtest  El,  rank  quite  high  in  terms 
of  g,  and  in  fact,  appear  to  measure  g  better  than  the  AFQT.  The 
basic  premise  of  the  present  study,  that  highly  g  loaded 
composites  (or,  good  measures  of  General  Cognitive  Ability) 
should  have  higlier  predictive  validities  than  specialized 
composites,  is  supported  by  Jones'  results  (see  Table  b).  If  the 
E  composite  were,  in  fact,  a  better  measure  of  g  than  the  AFQT, 
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Table  8 

R-loading  Rankings  of  ASVAB  Subtests 
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X 
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X® 
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the  results  obtained  through  the  present  meta-analysls  would  be 
expected,  because  the  meta-analysls  shows  that  E  demonstrates 
higher  validities  than  the  AFQT. 

This  conclusion  leads  to  a  consideration  of  the  G  composite 
as  compared  to  the  AFQT,  in  terms  of  g  loadings.  As  can  be 
observed  in  Table  8,  the  G  composite  is  actually  a  subset  of  the 
AFQT,  differing  only  as  to  its  exclusion  of  NO.  Adding 
components  to  a  measuring  instrument  will  Improve  measurement 
only  if  the  new  components  are  reliable  and  valid.  Addltloti  of 
NO  to  the  G-composite  subteots,  resulting  In  the  AFQ'l 
configuration,  would  not  be  expected  to  improve  measurement 
because  NO  is  insufficiently  reliable  compared  to  the  other 
subtests.  If  inclusion  of  NO  decreases  overall  validity  of  the 
AFQT,  we  would  expect  the  present  results  to  reflect  lower 
validities  for  the  AFQT  than  for  the  G  composite,  This  Is,  In 
fact,  what  is  observed.  AFQT  volldltles  range  from  ,8515  to 
.8896  (mean  r  values)  for  the  general  orcu|mllons.  According  l(» 
the  analyses,  these  values  are  significantly  lower  than  (#- 
composite  vnllditlcs  (mean  r  ranges  from  .9J9Z  to  .9510),  The 
G  composite  accounts  for  approximately  II  percent  more  vat  lance 
tlian  the  AF(|f  (see  Table  7). 

A  final  observation  relalea  !  (»  the  A  coflipoHlIe,  Tlii*  j; 
loading  rankings  and  mnla-niialyt Ic  results  again  oppeat  to 
coincide.  Mela-fllialysls  sliows  the  AFQT  to  be  a  rnoru  valid 
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predictor  than  the  A  composite.  The  median  g-loading  rank  of  the 
composite  is  about  6,  compared  to  about  5  for  the  AFQT.  In 
comparison,  both  composites  include  WK  and  PC.  The  A  composite, 
however,  contains  two  speeded  subtests,  both  of  comparatively 
low  g  loading  and  validity.  Three  of  the  four  A  composittj 
subtests  rank  among  the  lower  half,  in  terms  of  g.  Additionally, 
the  AFQT  contains  AR,  a  fairly  highly  g-loaded  subtest.  These 
differences  indicate  that  the  AFQT  is  more  highly  g  loaded  than 
A,  and  therefore  could  be  expected,  as  Jones'  results  indicate, 
to  predict  better  than  the  A  composite. 

Speeded  subtests  have  been  considered  problematic, 
particularly  due  to  their  characteristics  of  low  validity 
(Thorndike,  1986)  and  low  reliability  (Wegner  &  Ree,  1986). 

Wegner  and  Ree  (1986)  Investigated  the  effects  of  the  speeded 
subtest,  NO,  upon  the  validity  of  the  AFQT.  Specifically,  they 
attempted  to  identify  ASVAB  subtest  composites  alternative  to 
the  present  AFQT  that  would  contain  no  speeded  subtests,  and 
that  would  produce  validities  at  least  as  high  as  the  current 
AFQT.  Utilizing  ASVAB  scores  of  15A,788  Air  Force  recruits,  with 
technical  training  course  grades  as  the  validation  criterion, 
they  assessed  the  validities  of  15  AFQT  alternatives.  All  of  the 
alternatives  produced  uncorrected  validities  higher  than  that  of 
the  current  AFQT.  The  researchers  proposed  two  possible  reasons 
for  this  finding.  First,  the  results  could  be  due  to  effects  of 
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the  speeded  subtest  in  the  current  AFQT.  Secondly,  validity 
coefficients  may  have  been  reduced  due  to  curtailment  of  the 
sample  as  a  result  of  initial  selection  on  the  basis  of  the 
General  composite.  It  should  be  noted  again  that  the  G  composite 
is  a  subset  of  the  AFQT,  and  also,  when  analyzed  as  one  of  the 
15  AFQT  alternatives,  the  G  composite  produced  the  second-lowest 
validity  coefficient. 

Wegner  and  Ree  identified  three  of  the  alternative  AFQTs  as 
being  acceptable  alternatives  to  the  current  configuration.  One 
of  these  has  been  chosen  to  replace  the  current  AFQT  as  of 
January  1,  1989.  The  replacement  AFQT  is  composed  of 
AR  +  MK  +  2VE,  where  VE  =  WK  +  PC. 

Table  9  shows  the  current  and  replacement  AFQT  compositions 
in  relation  to  the  g-loading  ranks  of  their  subtests.  The 
replacement  AFQT  demonstrates  higher  validity  (Wegner  &  Ree, 

1986)  and  in  relation  to  Jones’  results,  higher  g  loading  than 
the  current  AFQT.  In  addition,  the  replacement  produces  minimal 
change  in  the  enlistment  qualifications  of  specific  subgroups 
(females,  blacks,  and  minorities). 

A  discrepancy  is  noted  between  the  present  meta-analytic 
results  and  those  of  Hunter,  et  al.  (1985).  Their  results,  based 
on  numerous  data  sets,  indicated  that  a  Speed  factor  may 
contribute  something  to  validity  for  clerical  occupations  above 
what  is  contributed  by  General  Cognitive  Ability.  Their  analysis 


68 


Table  9 

R-loading  Rankings  of  Current  and  Replacement  AFQTs 
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utilizing  occupational  composites  reported  in  the  DoD  Student 
Testing  Program  (often  called  the  High  School  ASVAB)  resulted  in 
a  Business  and  Clerical  composite  validity  of  .59  for 
corresponding  occupations,  and  only  about  .5A  for  other 
occupations.  (The  Business  and  Clerical  composite  was  composed 
of  WK  +  PC  +  CS  +  MK).  Validity  of  General  Cognitive  Ability  was 
only  .56  for  buslne.^s  and  clerical  occupations.  This  meta- 
analysis  indicated  opposite  results.  That  is,  the  general 
cognitive  measure  (AFQT)  had  higher  validity  than  the  clerical 
composite  (A)  for  administrative-type  jobs. 

Additionally,  a  meta-analysis  of  eight  military  studies 
produced  validity  coefficients  of  a  Speed  factor  for  clerical 
work  of  .A3,  versus  .37  for  other  types  of  work  (Hunter,  et  al., 
1985).  (Note  that  these  findings  do  not  contradict  Hunter's 
previously  described  work  based  on  the  data  of  Sims  and  Hiatt 
(1981),  Haier  and  Grafton  (1981),  and  Thorndike  7),  on  which 
he  based  his  path  model.  The  path  model  did  not  utilize  speeded 
subtests  (see  figures  1  and  2)).  The  present  meta-analy tic 
results  suggest  that  speeded  subtests  decrease  validity  in 
general,  but  that  they  may  be  of  even  greater  detriment  to  the 
business/clerical  (administrative)  occupations  than  others. 

The  reason  for  this  discrepancy  regarding  the  effects  of 
speeded  subtests  for  predicting  clerical  work  is  not  clear.  A 
possible  explanation  lies  in  the  differences  among  the  predictive 
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composites  and  occupational  criteria  used  in  the  various  studies. 
The  composite  configurations  used  by  Hunter  and  his  associates 
were  occupational  composites  constructed  to  be  relative  to 
civilian  occupations  (in  reference  to  the  high  school  composites) 
and  were  composites  used  by  various  brnnchea  of  the  aervlce.  The 
present  study,  however,  analysed  only  Air  Force  com|>osltas  and 
occupations.  Furtltcr  study  is  ncccssnry  to  concltjsivejy  verify 
this  assumption. 

Alternate  hypotheals  six  proposed  that  the  AFQT  would 
provide  greater  Iticrcmenlal  vaildify  for  femnles  ihnri  for  m/jles, 
Thia  was  verified  only  for  administrative  occupations.  Also,  the 
administrative  occupational  group  was  the  only  group  lor  which 
the  AFQT  validity  was  significantly  higher  thnn  the  composite 
validity.  If  a  different  general  cognitive  measure  were  to  t)« 
employed,  with  higher  g  loading  than  the  AFQT,  we  miglU  ex|>ecl 
to  see  higher  validities  for  the  general  cognitive  measure  as 
well  as  higher  increments  for  femnloB,  in  ell  occupalloiisl 
groups.  Conlliuied  resonreh  is  ncccssury,  however,  to  Invoaiigole 
possible  explunatlona  for  the  gender  dlffortMices.  Such  research 
might  include  inveotlgatlon  of  the  elfecls  nl  ilie  A  compoollo's 
speeded  subtests. 

Two  possible  explnnolions  nrn  ttroposed  for  the  iHifi'ri'nllfll 
validity  observed  in  this  meln-mm lysis;  1.  'Ilie  dlllerumeH 
accurutcly  reptf^sent  prcdlcLive  oblljtlcf  t»r  the  vm  loos 
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conpeiiltei;  2.  Ai  a  manure,  of  General  Cognitive  Ability,  the 
AFQT  ia  iniudiclenl. 

In  certain  condltlona,  evidence  of  differential  validity  may 
be  expected.  For  example,  Hunter  et  al.  (19B5)  atated; 

....  for  certain  hlfjhly  p.nomctrlc  profcsalona  in 
engineering  and  the  graphic  arts,  Spatial  Aptitude  makes 
a  ennt rll>ut ion  In  Job  performan'jr  bcyoiKl  that  of  General 
Cognitive  Ability  alone.  Thus,  there  ate  some  Jobs  in 
which  |»eople  do  Inilld  on  apecllic  leartilnp,  which  is 
menaured  by  the  a|n’cfflc  apiitti/lea.  (p  92) 

They  specify  however,  that  thii  la  not  the  case  with  Technical 
aptitude;  the  importance  of  a  Technical  aptitude  factor  is  in 
lie  fneroment  to  the  meaaurement  of  General  Copnillvc  Ability. 

'n)e  aptitude  areae  meaeured  by  the  ASVAB  (verbal,  quantitative, 
and  icchlilral)  ere  relatively  p,eneral,  exiallnp,  al  the  level  of 
aeroiKl-ordfr  (artore,  or  group  i actors,  accord Inp,  to  Jensen 
(I9U6),  .Spatial  aptitude  has  not  proven  uselul  an  a  group  factor 
or  as  a  component  In  group  factors,  for  predict ior»  of  military 
Irainlnp,  coiirso  success.  In  fact,  a  .Spatial  l’ercf;»llon  suhtest 
(HI'),  previously  contained  in  llie  A.SVAH,  waa  dropped  with  ASVAD 
forms  0,  9,  and  10,  Hunter  (l9B3b)  fndfeated  tlint,  fri  the 
prcaciirij  ul  r.ood  Tcclinlca)  nn<l  Qnont  Ifni  f  vc  niooHuros,  SI'  dl«l  not 
raise  validity  meaningfully. 

file  ijonclualoK  for  Ihfo  niula-nnalyfrJii  li  lii  H  on  the  work  of 
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Jones  (1988)  to  support  the  second  explanation  of  the 
differential  validity  observed.  As  major  points:  1.  General 
Cognitive  Ability  as  represented  by  the  AFQT  is  more  valid  as  a 
predictor  than  the  less  g-loaded  Administrative  composite;  2. 

The  Electronic  composite  represents  g  more  strongly  than  the 
AFQT,  thus  resulting  in  higher  validities  for  E;  3.  The  AFQT 
demonstrates  lower  validities  than  the  General  composite,  which 
is  presumably  due  to  the  AFQT's  inclusion  of  the  speeded,  low 
g-ranked  subtest,  NO;  4.  g  loadings  of  the  AFQT  versus  the 
Mechanic  composite  are  not  clearly  determinable  by  visual 
inspection.  However,  results  Indicate  little  differential 
validity  (.0212  to. 0354)  and  these  differences  are  not 
statistically  significant. 

Recommendations 

Predictive  validity  of  the  ASVAB  will  be  improved  by 
replacing  the  A  composite  with  a  more  valid  measure.  The  current 
AFQT  is  more  valid  than  A,  thus  it  would  be  a  useful  replacement, 
and  would  provide  an  easy  transition  because  it  is  currently 
scored.  The  G  composite  may  also  prove  to  be  a  useful 
replacement  for  A.  While  the  AFQT  appears  to  work  equally  well 
for  all  job  categories,  the  G  comnosite  is  more  valid  than  the 
AFQT  for  G-type  jobs  (see  Table  6),  thus  it  may  be  more  useful 
than  the  AFQT  for  A-type  jobs  as  well.  Other  subtest 
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configurations  should  be  investigated,  however,  if  the  goal  is  to 
provide  the  best  predictive  measure  for  administrative 
occupations.  Neither  the  AFQT  nor  the  G  composite  should  be 
assumed  the  best  possible  configuration  available  for  replacing 
the  A  composite. 

The  speeded  subtests,  NO  and  CS,  have  low  validities, 
reliabilities,  and  g  loadings.  Problems  presumed  due  to  its 
speeded  component  have  resulted  in  reconfiguration  of  the  AFQT. 
Further  Investigations  of  the  effects  of  these  subtests  may 
indicate  merit  in  dropping  them  entirely.  As  of  January  1, 

1989,  the  only  Air  Force  composite  to  utilize  NO  and/or  CS  will 
be  the  A  composite.  Re.search  could  be  performed  to  determine 
whether  validity  of  the  A  composite  may  be  improved  simply  by 
dropping  the  problematic  speeded  subtests. 

A  meta-analysis  similar  to  the  present  study  should  be 
performed,  based  on  data  from  the  replacement  AFQT,  when  it 
becomes  available  after  January  1,  1989.  Additionally,  similar 
meta-analyses  would  be  highly  informative  in  four  variations: 

1.  Employ  a  General  Cognitive  Ability  composite  that  is 
more  highly  g  loaded  than  the  current  AFQT,  possibly 
utilizing  the  four  subtests  with  highe.st  g-loading  rani's 
(see  Table  8). 

2.  Employ  a  General  Cognitive  Ability  composite  which 
includes  measures  of  Technical  aptitude,  such  as  the  El 


and  MC  subtesta. 
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3.  Evaluate  effects  of  gender  and  ethnicity  when  El  and  MC 
are  added  to  the  measure  of  General  Cognitive  Ability. 

A.  Evaluate  validity  of  the  G  composite  as  a  General 

Cognitive  Ability  measure,  including  a  comparison  of  the 
effects  of  G  versus  the  current  AFQT,  to  assess  the 
effects  of  the  speeded  AFQT  component,  NO. 
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Formulae  are  taken  from  Hunter,  Schmidt,  &  Jackson  (1982). 
Computations  were  performed  with  sample  estimates  of  the 
population  parameters. 


1. 


Correction  for  restriction  in  range; 

C 

U  =  — — —  Where:  S  =  standard  deviation  of 

®  percentile  scores  for 

the  unrestricted  group 
(population) 


and:  s  =  standard  deviation  of 
the  predictor  (ASVAB 
scores)  given  by 
Wilboiirn  et  al.  (198A) 


(u>  -  U/l  *  1 

Where:  =  reference  population 

'  correlation 


and: 


-  study  (sample) 
'  correlation 


2.  Correction  for  unreliability  of  the  predictor; 


r 


- 

'  r 

Where • 

r  =  validity  coefficient 

xy 

XX 

and : 

r  =  reiinhllity  coefficient 

of  the  predictor 

3.  Estimating  mean  and  variance  (performed  with  r's  as 


represented  in  Fisher's  z  form): 


Mean:  r= 


1  N, 


Corresponding 


variance:  = 

r 


£(  (r^  -  r)*  ) 

£  N, 


k.  Confidence  intervals: 

Step  1:  Compute  confidence  intervals  around  uncorrected 
correlations. 

Step  2:  Apply  formulae  given  in  //I  and  111  to  correct 
endpoints  of  obtaitied  intervals  for  range 
restriction  and  unreliability. 
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Table  B-1 

Raw  Validity  Data 


MECHANICAL  OCCUPATIONS 


Composite 

Total 

Males 

Females 

AFSC 

or 

AFQT 

_N _ 

_r_ 

N 

r 

36130 

C 

127 

.37 

125 

.39 

36130 

A 

127 

.44 

125 

.44 

42331 

C 

361 

.33 

308 

.31 

53 

.20 

42331 

A 

361 

.26 

308 

.29 

53 

.42 

42333 

C 

431 

.41 

363 

.44 

68 

.05 

42333 

A 

431 

.27 

363 

.31 

63 

.22 

42632 

C 

1238 

.46 

1079 

.46 

159 

.19 

42632 

A 

1238 

.42 

1079 

.45 

159 

.33 

42633 

C 

165 

.43 

146 

.  46 

42633 

A 

165 

.42 

146 

.49 

42733 

C 

158 

.31 

140 

.34 

42733 

A 

158 

.27 

140 

.22 

42735 

C 

550 

.27 

489 

.28 

61 

.21 

42735 

A 

550 

.41 

489 

.42 

61 

.31 

47231 

C 

134 

.52 

119 

.57 

47231 

A 

134 

.22 

119 

.27 

(table  continues) 
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AFSC 

Composite 

or 

AFQT 

Total 

Males 

Females 

N 

r 

N 

_r_ 

r 

47232 

C 

135 

.47 

123 

.48 

47232 

A 

135 

.34 

123 

.34 

55130 

C 

151 

.36 

149 

.36 

55130 

A 

151 

.34 

149 

.32 

55230 

\j 

100 

.29 

91 

.21 

55230 

A 

100 

.19 

91 

.23 

55232 

C 

115 

.47 

98 

.44 

55232 

A 

115 

.42 

98 

.44 

56631 

C 

172 

155 

.42 

56631 

A 

172 

.43 

155 

.51 

11430 

C 

111 

.25 

105 

.27 

11430 

A 

111 

.42 

105 

.42 

42731 

C 

322 

.16 

293 

.19 

29 

.35 

42731 

A 

322 

.41 

293 

.39 

29 

.43 

43130 

C 

155 

.46 

150 

.45 

43130 

A 

155 

.37 

150 

.38 

43131 

C 

2179 

.47 

2107 

.47 

72 

.35 

43131 

A 

2179 

.46 

2107 

.47 

72 

.38 

43132 

C 

2216 

.49 

2124 

.50 

92 

.43 

43132 

A 

2216 

.44 

2124 

.46 

92 

.46 

44330 

C 

248 

.47 

242 

.46 

( table  continues) 
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Composite 

Total 

Hales 

Females 

AFSC 

or 

AFQT 

N 

r 

N 

r 

N  r 

44330 

A 

248 

.39 

242 

.37 

46330 

C 

217 

.39 

203 

.41 

46330 

A 

217 

.  64 

203 

.66 

ADMINISTRATIVE  OCCUPATIONS 


60230 

C 

107 

.26 

75 

.27 

32 

.28 

60230 

A 

107 

h6 

75 

.47 

32 

.43 

60231 

C 

107 

.29 

78 

.31 

29 

.19 

60231 

A 

107 

.54 

78 

.53 

29 

.58 

70230 

C 

1841 

.15 

1280 

.20 

561 

.02 

70230 

A 

1841 

.38 

1280 

.20 

561 

.31 

60330 

C 

166 

.41 

111 

.42 

55 

.38 

60530 

A 

166 

.51 

111 

.53 

55 

.47 

20731 

C 

138 

.33 

96 

.39 

42 

.18 

20731 

A 

138 

.  64 

96 

.65 

42 

.55 

29333 

C 

132 

.16 

96 

.22 

36 

.01 

29333 

A 

132 

.30 

96 

.33 

36 

.21 

73230 

C 

679 

.31 

485 

.33 

194 

.25 

73230 

A 

679 

.51 

485 

.53 

194 

.45 

(table  continues) 
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GENERAL  OCCUPATIONS 


AFSC 

Composite 

or 

AFQT 

Total 

Hales 

Females 

N 

r 

N 

r 

N 

r 

57130 

C 

817 

.44 

793 

.44 

57130 

A 

817 

.42 

793 

.42 

62230 

C 

A48 

.38 

307 

.37 

141 

.40 

62230 

A 

AA8 

.35 

307 

.33 

141 

.41 

81130 

C 

A673 

.45 

4673 

.45 

81130 

A 

AG73 

.42 

4673 

.42 

81132 

C 

1855 

.49 

1617 

.49 

81132 

A 

1855 

.46 

1617 

.46 

27630 

C 

248 

.47 

197 

.50 

51 

.34 

27630 

A 

248 

.44 

197 

.48 

51 

.27 

29130 

C 

348 

.32 

228 

.31 

120 

.34 

29130 

A 

348 

.30 

228 

.27 

120 

.37 

51130 

C 

192 

.43 

155 

.41 

37 

.38 

51130 

A 

192 

.42 

155 

.43 

37 

.31 

90230 

C 

608 

.57 

446 

.59 

162 

.51 

90230 

A 

608 

.52 

446 

.53 

162 

.47 

90A30 

C 

121 

.37 

88 

.35 

33 

38 

90A30 

A 

121 

.37 

88 

.37 

33 

.36 

(table  continues) 


Composite 

Total 

Males 

Females 

AFSC 

or 

AFQT 

_N _ 

r 

N 

r 

N 

r 

90630 

C 

2A0 

.38 

177 

.37 

63 

.42 

90630 

A 

240 

.39 

177 

.40 

63 

.36 

91530 

C 

105 

.37 

73 

.43 

32 

.24 

91530 

A 

105 

.38 

73 

.39 

32 

.40 

9K130 

C 

222 

.32 

171 

.32 

51 

.40 

98130 

A 

222 

.32 

171 

.32 

51 

.30 

55330 

C 

130 

.59 

106 

.56 

55330 

A 

130 

.58 

106 

.55 

20230 

C 

135 

.45 

102 

.40 

33 

.57 

20230 

C 

135 

.40 

102 

.33 

33 

.56 

25130 

c 

195 

.34 

142 

.33 

53 

.29 

25130 

A 

195 

.33 

141 

.33 

53 

.29 

6A531 

C 

538 

.35 

419 

.35 

119 

.38 

6^531 

A 

538 

.36 

419 

.36 

119 

.40 

ELECTRONIC 

OCCUPATIONS 

40A31 

C 

142 

.60 

117 

.62 

25 

.48 

A0A31 

A 

142 

.53 

117 

.56 

25 

.35 

A2330 

C 

561 

.55 

488 

.53 

73 

.38 

(table  continues) 
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Composite 

Total 

Males 

Females 

AFSC 

or 

AFQT 

N 

r 

N 

r 

N 

r 

42330 

A 

561 

.47 

488 

.  46 

73 

.35 

30333 

C 

113 

.38 

107 

.37 

30333 

A 

113 

.27 

107 

.27 

44530 

C 

210 

.52 

205 

.50 

44530 

A 

210 

.45 

205 

.43 

30430 

C 

219 

.55 

203 

.56 

30430 

A 

219 

.43 

203 

.46 

30434 

C 

366 

.49 

332 

.48 

34 

.34 

30434 

A 

366 

.31 

332 

.34 

34 

.08 

30534 

C 

237 

.45 

215 

.43 

30534 

A 

237 

.29 

215 

.28 

30630 

C 

144 

.59 

125 

.57 

30630 

A 

144 

.43 

125 

.44 

30632 

C 

138 

.43 

121 

.45 

30632 

A 

138 

.28 

121 

.33 

30730 

C 

180 

.37 

146 

.40 

34 

.14 

30730 

A 

180 

.31 

146 

.34 

34 

.36 

31630 

C 

328 

.45 

309 

.47 

31630 

A 

328 

.24 

309 

.29 

32130 

C 

127 

.41 

117 

.45 

(table  continues) 
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Composite 

Total 

Males 

Females 

AFSC 

or 

AFQT 

N 

r 

N 

r 

N 

r 

32130 

A 

127 

.29 

117 

.32 

32123 

C 

288 

.49 

262 

.48 

26 

.54 

32123 

A 

288 

.43 

262 

.45 

26 

.38 

32232 

C 

Ikk 

.49 

222 

.50 

32232 

A 

Ikk 

.36 

222 

.36 

32A30 

C 

100 

.46 

93 

.47 

32^30 

A 

100 

.35 

93 

.36 

32530 

C 

245 

.41 

236 

.43 

32530 

A 

245 

.26 

236 

.27 

32531 

A 

342 

.40 

314 

.40 

28 

.33 

32531 

A 

342 

.34 

314 

.35 

28 

.35 

32633 

C 

105 

.54 

101 

.54 

32633 

A 

105 

.54 

101 

.56 

3263^ 

C 

188 

.36 

167 

.39 

3263A 

A 

188 

.29 

167 

.27 

32636 

C 

283 

.48 

254 

.53 

29 

.03 

32636 

A 

283 

.31 

254 

.32 

29 

32637 

C 

206 

.41 

19U 

.37 

32637 

A 

206 

.19 

190 

.21 

32638 

C 

290 

.54 

271 

.54 

(table  continues) 


AFSC 

Composite 

or 

AFQT 

Total 

N  r 

32638 

A 

290 

.39 

32830 

C 

351 

.56 

32830 

A 

351 

.36 

32831 

C 

297 

.45 

32831 

A 

297 

.33 

32833 

C 

2kk 

.53 

32833 

A 

Ikk 

.45 

3  28  3 A 

C 

218 

.44 

3283^ 


A 


218  .27 
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Male.s  Females 


N 

r 

N 

r 

271 

.39 

317 

.55 

34 

.50 

317 

.39 

34 

.40 

271 

.47 

26 

.07 

271 

.38 

26 

.00 

234 

.55 

234 

.48 

206 

.44 

206 


.29 
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CENSUS: 

TRAINING: 


EXPERIENCE: 


PERMANENT  ADDRESS: 

TYPIST: 


VITA 
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